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(54) Title: LENTIVIRAL VECTORS, RELATED REAGENTS, AND METHODS OF USE THEREOF 



(57) Abstract: The present invention provides new lentiviral vectors, including lentiviral transfer plasmids and infectious lentiviral 
particles. Lentiviral vectors of the invention were designed to offer a number of desirable features including reduced size, conve- 
nient cloning sites (including multiple cloning sites and sites for particularly useful restriction enzymes), loxP sites, self-inactivating 
LTRs, etc. Certain of the vectors are optimized for expression of reporter genes and/or for expression of siRNAs or shRNAs within 
eukaryotic cells. The invention also provides three and four plasmid lentiviral expression systems. In addition, the invention provides 
a variety of methods for using the vectors including gene silencing in cells and transgenic animals, and methods of treating disease. 
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LENTIVIRAL VECTORS, RELATED REAGENTS, 
AND METHODS OF USE THEREOF 

Cross-Reference to Related Applications 
5 [0001] This application claims priority to U.S. Provisional Patent Applications 
Ser. No. 60/408,558, filed September 6, 2002, Ser. No. 60/414,195, filed September 
27, 2002, and Ser. No. 60/428,039, filed November 21, 2002. The contents of each of 
these applications is incorporated herein by reference. 

1 0 Background of the Invention 

[0002] Viral vectors are efficient gene delivery tools in eukaryotic cells. Useful 
viral vectors have been created from different virus families, including retroviruses. 
Retroviruses have proven to be versatile and effective gene transfer vectors for a 
variety of applications since they are easy to manipulate, typically do not induce a 

15 strong anti- viral immune response, and are able to integrate into the genome of a host 
cell, leading to stable gene expression. If provided with an appropriate envelope, 
retroviruses can infect almost any type of cell. Due to these advantages a large 
number of retroviral vectors have been developed for in vitro gene transfer. In 
addition, use of retroviruses for purposes, such as the creation of transgenic or 

20 knockout animals, or for gene therapy, has been explored. 

[0003] However, vectors based on simple retroviruses (e.g., oncoretroviruses) 
have a number of disadvantages that limit their efficacy for such in vivo applications. 
For example, vectors based on simple retroviruses are generally unable to integrate 
into the genome of nondividing (postmitotic) cells. Furthermore, transgenes 

25 expressed from simple retroviruses are subject to silencing during development (22). 
To overcome these drawbacks, attention has recently focused on lentiviruses, a group 
of complex retroviruses that includes the human immunodeficiency virus (HIV). In 
addition to the major retroviral genes gag, pol, and env, lentiviruses typically include 
additional genes that play regulatory or structural roles. Unlike simple retroviruses, 

30 lentiviruses are able to integrate into the genome of non-dividing cells. Accordingly a 
variety of lentiviral vectors have been developed. However, existing lentiviral vectors 
remain less than optimal from a number of perspectives. For example, existing 
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lentiviral vectors are typically large in size, poorly characterized, and lack various 
features that facilitate cloning and uses of the vectors. Thus there remains a need in 
the art for improved lentiviral vectors. The present invention addresses this need. 
[0004] Rapid progress in technologies for sequencing genes and characterizing 
5 their expression profiles has resulted in a growing list of coding regions within 

mammalian genomes that are predicted to contribute to normal tissue function and to 
the development of disease. Traditionally, establishing gene function has been 
accomplished by gene targeting in mouse embryonic stem cells. While this 
technology has been responsible for many key breakthroughs in our understanding of 

10 the normal function as well as diseases of organs and tissues, it remains time- 
consuming and expensive to perform. Furthermore, current gene targeting approaches 
cannot be used to alter gene function in human tissues for the purposes of scientific 
investigation or gene therapy. For these reasons, alternative approaches to inhibit gene 
activity in primary cells and tissues have been explored. 

15 [0005] Among the most promising of these new approaches is RNA interference 
(RNAi), which has recently emerged as a rapid and efficient means to silence gene 
function in eukaryotic (including mammalian) cells. As initiallyy described in the 
nematode C. elegans, RNAi involves introduction of double-stranded RNA (dsRNA) 
into a cell thereby inhibiting gene expression in a sequence dependent fashion. More 

20 recently it has been shown that shorter dsRNA species known as short interfering 
RNAs (siRNA) can silence mammalian gene expression in a specific manner, 
suggesting that RNAi can be used to study and manipulate gene function in higher 
organisms as well. However, the use of RNAi in mammalian cells and organisms is 
currently restricted by the limited delivery methods available. Accordingly, there is a 

25 need in the art for improved reagents and methods that would facilitate the use of 

RNAi in mammalian cells and organisms. The present invention addresses this need, 
among others. 

Summary of the Invention 

30 [0006] The present invention provides novel lentiviral vectors that offer a number 
of features and advantages. In one aspect, the invention provides a lentiviral vector 
comprising the following elements: a nucleic acid whose sequence includes (i) a 
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functional packaging signal; (ii) a multiple cloning site (MCS); and (iii) at least one 
additional element selected from the group consisting of: a second MCS, a second 
MCS into which a heterologous nucleic acid is inserted, a human immunodeficiency 
(HIV) FLAP element, an expression-enhancing posttranscriptional regulatory 
5 element, a target site for a site-specific recombinase, and a self-inactivating (SIN) 
long terminal repeat (LTR). The lentiviral vector may be a lentiviral transfer plasmid 
or an infectious lentiviral particle. In various embodiments of the invention the 
expression-enhancing posttranscriptional regulatory element is a woodchuck hepatitis 
virus regulatory element (WRE), and/or the target site is a loxP site. The invention 
1 0 further provides collections of lentiviral plasmids possessing the features described 
above. 

[0007] In other aspects, the invention provides cells, including mammalian cells, 
and transgenic animals that contain any of the inventive lentiviral vectors or 
proviruses derived therefrom. The invention further provides methods for making 
15 transgenic animals the cells of which Comprise an inventive lentiviral vector or a 
provirus derived therefrom. 

[0008] The invention further provides a variety of lentiviral expression systems 
comprising inventive lentiviral transfer plasmids. For example, the invention 
provides a three-plasmid lentiviral expression system comprising: (a) a first plasmid 

20 whose sequence comprises a nucleic acid sequence of at least part of a lentiviral 
genome, wherein the plasmid (i) contains at least one defect in at least one gene 
encoding a lentiviral structural protein, and (ii) lacks a functional packaging signal; 
(b) a second plasmid whose sequence comprises a nucleic acid sequence of a virus, 
wherein the plasmid (i) expresses a viral envelope protein, and (ii) lacks a functional 

25 packaging signal; and (c) a third plasmid whose nucleic acid sequence includes (i) a 
functional packaging signal; (ii) a multiple cloning site (MCS); and (iii) at least one 
additional element selected from the group consisting of: a second MCS, a second 
MCS into which a heterologous nucleic acid is inserted, an HIV FLAP element, an 
expression-enhancing posttranscriptional regulatory element, a target site for a site- 

30 specific recombinase, and a self-inactivating (SIN) LTR. 
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[0009] The invention further provides a four plasmid lentiviral expression system, 
in which three of the plasmids are as described immediately above and the fourth 
plasmid encodes the Rev protein. 

[0010] The invention provides methods of creating infectious lentiviral particles 
5 and of creating producer cell lines that produce infectious lentiviral particles. The 
lentiviral particles may, but need not be, derived from the lentiviral transfer plasmids 
as described herein. 

[0011] The invention further provides a method for introducing and expressing a 
heterologous nucleic acid in a target cell comprising introducing a lentiviral vector of 
10 the invention into the target cell and expressing the heterologous nucleic acid therein. 
In various embodiments of the invention the heterologous nucleic acid is operably 
linked to a constitutive, an inducible, or a cell type or tissue specific promoter, 
allowing conditional expression of the nucleic acid. 

[0012] In another aspect, the invention provides a method for achieving controlled 
15 expression of a heterologous nucleic acid in a cell comprising steps of: (i) inserting 
the heterologous nucleic acid into a lentiviral vector between sites for a recombinase, 
thereby producing a modified lentiviral vector; (ii) introducing the modified lentiviral 
vector or a portion thereof including at least the sites for the recombinase and the 
region between the sites into the cell and; (iii) subsequently inducing expression of 
20 the recombinase within the cell, thereby preventing expression of the heterologous 
nucleic acid within the cell. 

[0013] The invention also provides a method for expressing a transcript in a 
mammal in a cell type or tissue-specific manner comprising: (i) delivering a lentiviral 
vector to cells of the mammal, wherein the lentiviral vector comprises a heterologous 
25 nucleic acid, and wherein the heterologous nucleic acid is located between sites for a 
site-specific recombinase; and (ii) inducing expression of the site-specific 
recombinase in a subset of the cells of the mammal, thereby preventing synthesis of 
the transcript within those cells. 

[0014] In another aspect, the invention provides a lentiviral vector whose 
30 presence within a cell results in transcription of one or more ribonucleic acids (RNAs) 
that self-hybridize or hybridize to each other to form a short hairpin RNA (shRNA) or 
short interfering RNA (siRNA) that inhibits expression of at least one target transcript 
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in the cell. In certain embodiments of the invention the lentiviral vector comprises a 
nucleic acid segment operably linked to a promoter, so that transcription from the 
promoter (i.e., transcription directed by the promoter) results in synthesis of an RNA 
comprising complementary regions that hybridize to form an shRNA targeted to the 
5 target transcript. (When an RNA comprises complementary regions that hybridize 
with each other, the RNA will be said to self-hybridize.) According to certain 
embodiments of the invention the shRNA comprises a base-paired region 
approximately 19 nucleotides long. According to certain embodiments of the 
invention the RNA may comprise more than 2 complementary regions, so that self- 
10 hybridization results in multiple base-paired regions, separated by loops or single- 
stranded regions. The base-paired regions may have identical or different sequences 
and thus may be targeted to the same or different regions of a single transcript or to 
different transcripts. 

[0015] In certain embodiments of the invention the lentiviral vector comprises a 
15 nucleic acid segment flanked by two promoters in opposite orientation, wherein the 
promoters are operably linked to the nucleic acid segment, so that transcription from 
the promoters results in synthesis of two complementary RNAs that hybridize with 
each other to form an siRNA targeted to the target transcript. According to certain 
embodiments of the invention the siRNA comprises a base-paired region 
20 approximately 19 nucleotides long. In certain embodiments of the invention the 
lentiviral vector comprises at least two promoters and at least two nucleic acid 
segments, wherein each promoter is operably linked to a nucleic acid segment, so that 
transcription from the promoters results in synthesis of two complementary RNAs 
that hybridize with each other to form an siRNA targeted to the target transcript. The 
25 nucleic acid segment(s) present within the lentiviral vectors may be part of a larger 
nucleic acid, e.g., a heterologous nucleic acid that is inserted into the vector as 
described herein. 

[0016] The lentiviral vectors of the invention may be lentiviral transfer plasmids 
or infectious lentiviral particles (e.g., a lentivirus or pseudotyped lentivirus). As 
30 discussed further below, lentiviruses have an RNA genome. Therefore, where the 
lentiviral vector is a lentiviral particle, the viral genome must undergo reverse 
transcription and second strand synthesis to produce DNA capable of directing RNA 
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transcription. In addition, where reference is made herein to elements such as cloning 
sites, promoters, regulatory elements, etc., it is to be understood that the sequences of 
these elements are present in RNA form in the lentiviral particles of the invention and 
are present in DNA form in the lentiviral transfer plasmids of the invention. 
5 Furthermore, where a template for synthesis of an RNA is "provided by" RNA 
present in a lentiviral particle, it is understood that the RNA must undergo reverse 
transcription and second strand synthesis to produce DNA that can serve as a template 
for synthesis of RNA (transcription). 

[0017] The invention further provides pharmaceutical compositions comprising 

10 any of the inventive lentiviral vectors and a pharmaceutically acceptable carrier. 

[0018] The invention further provides a three plasmid lentiviral expression system 
comprising (i) a lentiviral transfer plasmid, wherein the lentiviral transfer plasmid 
directs transcription of at least one ribonucleic acid (RNA) that, when present within a 
cell, hybridizes to form an shRNA or siRNA that inhibits expression of at least one 

15 gene expressed in the cell, (ii) a packaging plasmid; and (iii) an Env-coding plasmid. 
In certain embodiments of the invention the lentiviral transfer plasmid comprises a 
nucleic acid segment operably linked to a promoter, so that transcription from the 
promoter results in synthesis of an RNA that hybridizes to form an shRNA targeted to 
a target transcript. In certain embodiments of the invention the lentiviral transfer 

20 plasmid comprises a nucleic acid segment flanked by two oppositely directed 

promoters, wherein the promoters are operably linked to the nucleic acid segment, so 
that transcription from the promoters results in synthesis of two complementary 
RNAs that hybridize with each other to form an siRNA targeted to a target transcript. 
In certain embodiments of the invention the lentiviral transfer plasmid comprises two 

25 promoters and two nucleic acid segments, wherein each promoter is operably linked 
to a nucleic acid segment, so that transcription from the promoters results in synthesis 
of two complementary RNAs that hybridize with each other to form an siRNA 
targeted to a target transcript. The lentiviral transfer plasmid may, but need not be, 
any of the inventive lentiviral transfer plasmids described herein. 

30 [0019] The invention further provides a four plasmid lentiviral expression system 
comprising a three plasmid lentiviral expression system as described above and a 
fourth plasmid that encodes the Rev protein. 
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[0020] The invention additionally provides a method of inhibiting or reducing the 
expression of a target transcript in a cell comprising delivering a lentiviral vector to 
the cell, wherein presence of the lentiviral vector within the cell results in synthesis of 
one or more RNAs that self-hybridize or hybridize with each other to form an shRNA 
5 or siRNA that inhibits expression of the target transcript. Note that where presence of 
the lentiviral vectors, particles, or plasmids of the invention results in production of an 
shRNA, the shRNA may require further processing within the cell to form an 
inhibitory structure. shRNAs that are so processed are considered to inhibit expression 
of the target transcript. 

1 0 [0021] The invention further provides a method for reversibly inhibiting or 
reducing expression of a target transcript in a cell comprising: (i) delivering a 
lentiviral vector to the cell, wherein presence of the lentiviral vector within the cell 
results in synthesis of one or more RNAs that self-hybridize or hybridize with each 
other to form an shRNA or siRNA that inhibits expression of the target transcript, 

15 wherein the lentiviral vector comprises a nucleic acid segment located between sites 
for a site-specific recombinase, which nucleic acid segment provides a template for 
transcription of the one or more RNAs; and (ii) inducing expression of the site- 
specific recombinase within the cell, thereby preventing synthesis of at least one of 
the RNAs. The vector can be a lentiviral transfer plasmid or lentiviral particle. 

20 [0022] The invention also provides a method for reversibly inhibiting or reducing 
expression of a transcript in a mammal in a cell type or tissue-specific manner 
comprising: (i) delivering to the mammal a lentiviral vector whose presence within a 
cell results in synthesis of one or more RNAs that self-hybridize or hybridize with 
each other to form an shRNA or siRNA that inhibits expression of the target 

25 transcript, wherein the lentiviral vector comprises a nucleic acid segment located 

between sites for a site-specific recombinase, which nucleic acid segment provides a 
template for transcription of the RNA; and (ii) inducing expression of the site-specific 
recombinase in a subset of the cells of the mammal, thereby preventing synthesis of at 
least one of the RNAs within the subset of cells. In any of the above methods, the cell 

30 may be a mammalian cell, the site-specific recombinase may be Cre, and the sites 
maybe loxP sites. 
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[0023] The invention includes a variety of therapeutic applications for the 
inventive lentiviral vectors. In particular, the lentiviral vectors are useful for gene 
therapy. The invention provides a method of treating or preventing infection by an 
infectious agent, the method comprising the step of administering to a subject prior to, 
5 simultaneously with, or after exposure of the subject to the infectious agent, a 
composition comprising an effective amount of a lentiviral vector, wherein the 
lentiviral vector directs transcription of at least one RNA that hybridizes to form an 
shRNA or siRNA that is targeted to a transcript produced during infection by the 
infectious agent, which transcript is characterized in that reduction in levels of the 
10 transcript delays, prevents, or inhibits one or more aspects of infection by or 
replication of the infectious agent. 

[0024] In addition, the invention provides a method of treating or preventing a 
disease or clinical condition, the method comprising: (i) removing a population of 
cells from a subject at risk of or suffering from disease or clinical condition; (ii) 

15 engineering or manipulating the cells to contain an effective amount of an siRNA or 
shRNA targeted to a transcript by infecting or transfecting the cells with a lentiviral 
vector, wherein the transcript is characterized in that its degradation delays, prevents, 
or inhibits one or more aspects of the disease or clinical condition; (iii) and returning 
at least a portion of the cells to the subject. Suitable lentiviral vectors are described 

20 herein. Without intending to suggest any limitation, the therapeutic approaches may 
find particular use in diseases such as cancer, in which a mutation in a cellular gene is 
responsible for or contributes to the pathogenesis of the disease, and in which specific 
inhibition of the target transcript bearing the mutation may be achieved by expressing 
an siRNA or shRNA targeted to the target transcript within the cells, without 

25 interfering with expression of the normal allele. According to certain embodiments of 
the invention, rather than removing cells from the body of a subject, infecting or 
transfecting them in tissue culture and then returning them to the subject, inventive 
lentiviral vectors or lenti viruses are delivered directly to the subject. 
[0025] This application refers to various patents, journal articles, and other 

30 publications, all of which are incorporated herein by reference, ha addition, the 

following publications are incorporated herein by reference: Current Protocols in 
Molecular Biology, Current Protocols in Immunology, Current Protocols in Protein 
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Science, and Current Protocols in Cell Biology, John Wiley & Sons, N.Y., edition as 
of July 2002; Sambrook, Russell, and Sambrook, Molecular Cloning: A Laboratory 
Manual, 3 rd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 2001. 



5 Brief Description of the Drawing 

[0026] Figure 1 shows a map of pBFGW. 

[0027] Figure 2 shows a map of pLL3.0. 

[0028] Figure 3 shows a map of pLL3 . 1 . 

[0029] Figure 4 shows a map of pLL3.2. 
1 0 [0030] Figure 5 shows a map of pLL3.3. 

[0031] Figure 6 shows a map of pLL3.4. 

[0032] Figure 7 shows a map of pLL3.5. 

[0033] Figure 8 shows a map of pLL3.6. 

[0034] Figure 9 shows a map of pLL3.7. 
1 5 [0035] Figure 1 OA shows schematic diagrams of the HIV pro virus (upper panel) 

and relevant portions of representative packaging and Env-coding plasmids (middle 

and lower panels, respectively) for a three plasmid system. 

[0036] Figure 10B shows schematic diagrams of the HIV provirus (upper panel) 
and relevant portions of representative packaging, Rev-coding and Env-coding 
20 plasmids (second, third, and lower panels, respectively) for a four plasmid system. 

[0037] Figure 11 shows the siRNA structure found to be active in the Drosophila 
system. 

[0038] Figure 12 presents a schematic representation of the steps involved in 
RNA interference in Drosophila. 
25 [0039] Figure 13 shows a schematic diagram of a variety of exemplary shRNA 
structures useful in accordance with the present invention. 

[0040] Figure 14 presents a representation of an alternative inhibitory pathway, in 
which the DICER enzyme cleaves a substrate having a base mismatch in the stem to 
generate an inhibitory product that binds to the 3' UTR of a target transcript and 
30 inhibits translation. 
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[0041] Figure 15 presents a schematic diagram of a nucleic acid that serves as a 
template for transcription of an RNA that hybridizes to form an shRNA and also 
shows the RNA before and after hybridization. 

[0042] Figure 16 presents a schematic diagram of one example of a construct that 
5 may be used to direct transcription of sense and antisense strands of an siRNA. 
[0043] Figure 1 7 A presents a schematic representation of a portion of the 
lentivirus vector pLL3. 7. Key: SIN-LTR: self-inactivating long terminal repeat; *F: 
HIV packaging signal; cPPT: central polypurine track; U6: U6 (RNA polymerase III) 
promoter; MCS: multiple cloning site; CMV: cytomegalovirus (RNA polymerase II) 
10 promoter; EGFP: enhanced green fluorescent protein; WRE: woodchuck hepatitis 
virus response element. 

[0044] Figure 1 7B presents the sequence of the CD8 stem loop used to generate 
pLL3.7 CD8 (See Examples). A sequence known to silence CD8 as an siRNA (1 1) 
was adapted with a loop sequence from Paddison et al. (39) to create the final 

15 sequence. The presumed transcription initiation site is indicated by a +1 . Nucleotides 
which form the loop structure are indicated in green font (Loop). The pol III 
terminator stretch (a stretch of Us in the RNA) is indicated in red font. 
[0045] Figure 1 7C shows the predicted structure of the CD8 stem-loop RNA 
produced from pLL3.7 CD8. 

20 [0046] Figure 18A shows density plots demonstrating specific silencing of CD 8 
expression by pLL3.7 CDS. CD8 + CD4 + E10 cells were either mock infected (No 
Virus), infected with a pLL3.7 (Control Virus), or pLL3.7 CD 8 (CD8 RNAi virus). 
Density plots indicate the expression levels of CD4 and CD8 48 hours post-infection. 
[0047] Figure 18B presents histograms showing staining for the T cell surface 

25 markers, CD3, TCRp, and CD28. The histograms show that other surface markers 
are unaffected by silencing of CD8. E10 cells infected with pLL3.7 (green 
histograms) or pLL3.7 CD8 (pink histograms) were stained for CD3, TCRp, and 
CD28. Solid histograms represent the level of these surface markers on uninfected 
cells. 

30 [0048] Figure 19 A shows stable silencing of CD8 by pLL3.7 CD8. Sorted 

populations of infected E10 cells were maintained in long-term culture. E10 cells 
pLL3.7 CD8 (CD8 RNAi virus) were sorted four days after infection for GFP 
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expression and low CDS expression, while cells infected with control virus were 
sorted for GFP expression only. Each population was cultured for 1 month and 
analyzed for CD8 expression via flow cytometry at weekly intervals. The CD 8 and 
GFP levels expressed by infected cells 4 days following infection and after one month 
5 of culture are shown. 

[0049] Figure 19B shows a Northern blot showing specific degradation of CD8 
mRNA induced by pLL3.7 CD8. CD8 and CD4 mRNA levels in uninfected E10 cells, 
or E10 cells infected with either pLL3.7 (Control Virus) or pLL3.7 CD 8 (CDS RNAi 
Virus) and sorted on the basis of GFP and CD8 expression, were assayed. The bands 

10 representing CD8 and CD4 mRNA species are identified by lines (top panel). 

[0050] Figure 19C shows generation of processed shRNAs in cells infected with 
pLL3.7 CD8. The cells analysed for CD8 and CD4 mRNA levels described in the 
legend to Figure 18B were also assayed for the presence of shRNAs by Northern blot. 
The location of 21, 22, and 23 nucleotide RNAs are identified by arrows. 

15 [0051] Figure 20A presents flow cytometric analysis showing specific silencing 
of genes in primary T cells by pLL3.7 CD 8 and pLL3.7 CD25. CD8 + TCR transgenic 
T cells were activated for 3 days with cognate peptide and then infected with pLL3.7, 
pLL3.7 CD8, or pLL3.7 CD25. The efficiency of infection was determined by 
assaying GFP expression by flow cytometry. The expression of CD 8 and CD25 on 

20 infected T cells was assayed by staining with specific antibodies that bind these 
surface markers. 

[0052] Figure 20B is a bar graph showing functional silencing of genes in 
primary T cells with pLL3.7 CD25. CD8* TCR transgenic T cells were infected and 
activated as in A. and then cultured for 48 hours in the presence of increasing 

25 concentrations of IL-2. Proliferation was assessed by 3 H-thymidine incorporation. 
[0053] Figure 21 A shows flow cytometric analysis of expression of GFP from 
pLL3.7 CD8 infection in the AK7 ES cell line. AK7 ES cells were infected with 
pLL3.7 CD8 and sorted for GFP expression (green line) and compared with 
uninfected control (purple peak). 

30 [0054] Figure 2 IB shows fluorescent imaging of paws of ES cell-derived mice. 
The paws of control and pLL3.7 CD8 ES chimeric mice were imaged with standard 
epifluorescence for expression of EGFP. 



Page 11 of 171 



WO 2004/022722 



PCT/US2003/028111 



[0055] Figure 21 C shows flow cytometric identification of ES cell-derived 
thymocytes in chimeric mice. Thymocytes from noninfected (purple peak) and 
pLL3.7 CD8 (green line) ES derived mice were harvested and analyzed for GFP 
expression. 

5 [0056] Figure 2 ID is a photograph showing expression of CD4 and CD8 in the 
thymus and spleen of ES cell-derived mice. Thymocytes and splenocytes from week 
old control and CD 8 RNAi (pLL 3.7 CD8) ES cell-derived mice were harvested and 
stained for CD4 and CD8 expression. 

[0057] Figure 22 A shows flow cytometric analysis of EGFP expression in cells 
10 infected with an EGFP-expressing lentiviral vector in which the promoter and EGFP 
coding sequences are floxed. The solid purple peaks represent uninfected cells. The 
population of cells expressing EGFP is shown with a green line. 
[0058] Figure 22B shows flow cytometric analysis of EGFP expression in cells 
infected with an EGFP-expressing lentiviral vector 10 days after induction of Cre 
15 expression. The solid purple peaks represent uninfected cells. The population of 
cells expressing EGFP is shown with a green line. 

[0059] Figure 22C shows a direct flow cytometric comparison between pLL3.7 

infected D7 cells before (green line) and after (pink line) Cre delivery. 

[0060] Figure 23 shows flow cytometric analysis of CD 8 expression in T cells 

20 transfected with transfer plasmids that direct expression of either an shRNA targeted 
to CD 8 or an irrelevant stem-loop sequence, demonstrating silencing of CD 8 by the 
CD8 shRNA. GFP expression is on the x-axis, and CD8 expression is on the y-axis. 
The upper panel shows lack of GFP expression in untransfected cells. The middle 
panel shows CD8 expression in GFP* cells transfected with a transfer plasmid 

25 targeted to an unrelated sequence. The lower panel shows reduced CD 8 expression in 
GFP + cells transfected with a transfer plasmid targeted to CD8. 
[0061] Figure 24 shows flow cytometric analysis of expression of transfected 
human CD8 in wild type ES cells or ES cells infected with a mouse CD8 shRNA 
virus, demonstrating that the mouse CD 8 shRNA specifically silences mouse CD 8 

3 0 and not human CD8 . 
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[0062] Figure 25 is a Northern blot showing that higher expression levels of CDS 
shRNA in cells that did (left) versus cells that did not (right) exhibit silencing of CD8 
following infection with a mouse CD8 shRNA virus. 

5 Definitions 

[0063] The term defective as used herein refers to a nucleic acid that is not 
functional with regard to either (i) encoding its gene product or (ii) serving as a 
signaling sequence. For example, a defective env gene sequence does not encode a 
functional Env protein; a defective packaging signal will not facilitate the packaging 

10 of a nucleic acid molecule that includes the defective signal. A nucleic acid may be 
defective for some but not all of its functions. For example, a defective LTR may fail 
to promote transcription of downstream sequences while still retaining the ability to 
direct integration. Nucleic acid sequences may be made defective by any means 
known in the art, including by mutagenesis, by the deletion of some or all of the 

15 sequence, by inserting a heterologous sequence into the nucleic acid sequence, by 

placing the sequence out-of-frame, or by otherwise blocking the sequence. Defective 
sequences may also occur naturally, i.e., without human intervention, such as by 
mutation, and may be isolated from viruses in which they arise. Proteins that are 
encoded by a defective nucleic acid and are therefore not functional may be referred 

20 to as defective proteins. It is to be understood that the term "defective" is relative. In 
other words, the function need not be completely eliminated but is typically 
substantially reduced relative to the comparable wild type function. Generally, a 
defective sequence exhibits less than approximately 10% of the function of the 
comparable wild type sequence, preferably less than approximately 5% of the 

25 function of the comparable wild type sequence, yet more preferably less than 

approximately 2%, less than approximately 1%, less than approximately 0.5%, or 
approximately 0%, i.e., below the limits of detection. 

[0064] The terms deleted or deletion are used herein in accordance with their 
standard usage in the art, i.e., meaning either total removal of the specified segment or 
30 the removal of a sufficient portion of the specified segment to render the segment 
inoperative or nonfunctional with respect to at least one of its functions. 
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[0065] The term heterologous as used herein in reference to a nucleic acid, refers 
broadly to a first nucleic acid that is inserted into a second nucleic acid such as a 
plasmid or vector. In particular, the term refers to a nucleic acid that is not naturally 
present in the wild type version of a virus-based vector or plasmid that is used to 
5 deliver the sequence into a cell. The term also refers to a nucleic acid that is 

introduced into a cell, tissue, organism, etc., by artificial means including, but not 
limited to, transfection, transformation, or infection with a viral vector. Generally the 
nucleic acid is either not naturally found in the cell, tissue, or organism or, if naturally 
found therein, its expression is altered by introduction of the additional copy of the 
10 nucleic acid (e.g., if the introduced copy is under the control of a different promoter 
than the naturally occurring copy). The term is also used to refer to a protein encoded 
by such a nucleic acid sequence. If a heterologous sequence is introduced into a cell 
or organism, the sequence is considered heterologous to the progeny of such a cell or 
organism. 

1 5 [0066] The term hybridize, as used herein, refers to the interaction between two 
complementary nucleic acid sequences. The phrase hybridizes under high stringency 
conditions describes an interaction that is sufficiently stable that it is maintained under 
art-recognized high stringency conditions. Guidance for performing hybridization 
reactions can be found, for example, in Current Protocols in Molecular Biology, John 

20 Wiley & Sons, N.Y., 6.3. 1-6.3.6, 1989, and more recent updated editions, all of which 
are incorporated by reference. See also Sambrook, Russell, and Sambrook, Molecular 
Cloning: A Laboratory Manual, 3 rd ed., Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, 2001. Aqueous and nonaqueous methods are described in that 
reference and either can be used. Typically, for nucleic acid sequences over 

25 approximately 50-100 nucleotides in length, various levels of stringency are defined, 
such as low stringency (e.g., 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by two washes in 0.2X SSC, 0.1% SDS at least at 50°C (the temperature of 
the washes can be increased to 55°C for medium-low stringency conditions)); 
medium stringency (e.g., 6X SSC at about 45°C, followed by one or more washes in 

30 0.2X SSC, 0.1% SDS at 60°C); high stringency (e.g., 6X SSC at about 45°C, followed 
by one or more washes in 0.2X SSC, 0.1% SDS at 65°C); and very high stringency 
(e.g., 0.5 M sodium phosphate, 0.1% SDS at 65°C, followed by one or more washes at 
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0.2X SSC, 1% SDS at 65°C.) Hybridization under high stringency conditions only 
occurs between sequences with a very high degree of complementarity. One of 
ordinary skill in the art will recognize that the parameters for different degrees of 
stringency will generally differ based various factors such as the length of the 
5 hybridizing sequences, whether they contain RNA or DNA, etc. For example, 
appropriate temperatures for high, medium, or low stringency hybridization will 
generally be lower for shorter sequences such as oligonucleotides than for longer 
sequences. 

[0067] Infectious,, as used herein in reference to a recombinant virus or viral 

10 particle, indicates that the virus or viral particle is able to enter cells in a manner 

substantially similar or identical to that of a wild type virus and to perform at least one 
of the functions associated with infection by a wild type virus, e.g., release of the viral 
genome in the host cell cytoplasm, entry of the viral genome into the nucleus, reverse 
transcription and integration of the viral genome into the host cell's DNA. It is not 

15 intended to indicate that the virus or viral particle is capable of undergoing replication 
or of completing the viral life cycle. The terms "viral particle" and "virus" are 
frequently used interchangeably herein. For example, the phrase "production of 
virus" may refer to production of viral particles and is not intended to indicate that 
wild type or replication competent virus is produced. 

20 [0068] Isolated, as used herein, means 1) separated from at least some of the 

components with which it is usually associated in nature; 2) prepared or purified by a 
process that involves the hand of man; and/or 3) not occurring in nature. 
[0069] Operably linked, as used herein, refers to a relationship between two 
nucleic acid sequences wherein the expression of one of the nucleic acid sequences is 

25 controlled by, regulated by, modulated by, etc., the other nucleic acid sequence. For 
example, the transcription of a nucleic acid sequence is directed by an operably linked 
promoter sequence; post-transcriptional processing of a nucleic acid is directed by an 
operably linked processing sequence; the translation of a nucleic acid sequence is 
directed by an operably linked translational regulatory sequence; the transport or 

30 localization of a nucleic acid or polypeptide is directed by an operably linked 
transport or localization sequence; and the post-translational processing of a 
polypeptide is directed by an operably linked processing sequence. Preferably a 
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nucleic acid sequence that is operably linked to a second nucleic acid sequence is 
covalently linked, either directly or indirectly, to such a sequence, although any 
effective three-dimensional association is acceptable. 

[0070] Purified, as used herein, means separated from many other compounds or 
5 entities, e.g., compounds or entities with which it normally occurs in nature. A 

compound or entity may be partially purified, substantially purified, or pure, where it 
is pure when it is removed from substantially all other compounds or entities, i.e., is 
preferably at least about 90%, more preferably at least about 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or greater than 99% pure. 

10 [0071] The term regulatory sequence is used herein to describe a region of nucleic 
acid sequence that directs, enhances, or inhibits the expression (particularly 
transcription, but in some cases other events such as splicing or other processing, 
translation, etc.) of sequence(s) with which it is operatively linked. The term includes 
promoters, enhancers and other transcriptional control elements. In some 

15 embodiments of the invention, regulatory sequences may direct constitutive 

expression of a nucleotide sequence; in other embodiments, regulatory sequences may 
direct tissue-specific and/or inducible expression. For instance, non-limiting 
examples of tissue-specific promoters appropriate for use in mammalian cells include 
lymphoid-specific promoters (see, for example, Calame et al., Adv. Immunol. 43:235, 

20 1988) such as promoters of T cell receptors (see, e.g., Winoto et al., EMBO J. 8:729, 
1989) and immunoglobulins (see, for example, Banerji et al., Cell 33:729, 1983; 
Queen et al., Cell 33:741, 1983), and neuron-specific promoters {e.g., the 
neurofilament promoter; Byrne et al., Proc. Natl. Acad. Set USA 86:5473, 1989). 
Developmentally-regulated promoters are also encompassed, including, for example, 

25 the murine hox promoters (Kessel et al., Science 249:374, 1990) and the a-fetoprotein 
promoter (Campes et al., Genes Dev. 3:537, 1989). In some embodiments of the 
invention regulatory sequences may direct expression of a nucleotide sequence only 
in cells that have been infected with an infectious agent. For example, the regulatory 
sequence may comprise a promoter and/or enhancer such as a virus-specific promoter 

30 or enhancer that is recognized by a viral protein, e.g., a viral polymerase, transcription 
factor, etc. 
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[0072] A short, interfering RNA (siRNA) comprises an RNA duplex that is 
approximately 19 basepairs long and optionally further comprises one or two single- 
stranded overhangs or loops. An siRNA may be formed from two RNA strands that 
hybridize together, or may alternatively be generated from a single RNA strand that 
5 includes a self-hybridizing portion. When siRNAs include one or more free strand 
ends, it is generally preferred that free 5' ends have phosphate groups, and free 3' 
ends have hydroxyl groups. siRNAs include a portion that hybridizes with a target 
transcript. In certain preferred embodiments of the invention, one strand of the 
siRNA is precisely complementary with a region of the target transcript, meaning that 

10 the siRNA hybridizes to the target transcript without a single mismatch. In other 
embodiments of the invention one or more mismatches between the siRNA and the 
targeted portion of the target transcript may exist. In most embodiments of the 
invention in which perfect complementarity is not achieved, it is generally preferred 
that any mismatches be located at or near the siRNA termini. 

15 [0073] The term short hairpin RNA refers to an RNA molecule comprising at least 
two complementary portions hybridized or capable of hybridizing to form a double- 
stranded structure sufficiently long to mediate RNAi (typically at least 19 base pairs 
in length), and at least one single-stranded portion, typically between approximately 1 
and 10 nucleotides in length that forms a loop. As described further below, shRNAs 

20 are thought to be processed into siRNAs by the conserved cellular RNAi machinery. 
Thus shRNAs are precursors of siRNAs and are similarly capable of inhibiting 
expression of a target transcript. 

[0074] The phrase structural protein as used herein refer to the proteins which are 
required for encapsidation (e.g., packaging) of a retroviral or lentiviral genome, and 

25 include Gag, Pol and Env. 

[0075] The term subject, as used herein, refers to any individual to whom a 
lentiviral vector of the invention is delivered for any purpose. Preferred subjects are 
mammals, particularly rodents (e.g., mice and rats), domesticated mammals (e.g., 
dogs, cats, etc.), primates, or humans. 

30 [0076] An siRNA or shRNA or an siRNA or shRNA sequence is considered to be 
targeted to target transcript for the purposes described herein if 1) the stability of the 
target transcript is reduced in the presence of the siRNA or shRNA as compared with 



Page 17 of 171 



WO 2004/022722 



PCT/US2003/028111 



its absence; and/or 2) the siRNA or shRNA shows at least about 90%, more 
preferably at least about 91%, 92% 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% 
precise sequence complementarity with the target transcript for a stretch of at least 
about 17, more preferably at least about 18 or 19 to about 21-23 nucleotides; and/or 3) 
5 one strand of the siRNA or one of the self-complementary portions of the shRNA 
hybridizes to the target transcript under stringent conditions for hybridization of small 
(<50 nucleotide) RNA molecules in vitro and/or under conditions typically found 
within the cytoplasm or nucleus of mammalian cells. Since the effect of targeting a 
transcript is to reduce or inhibit expression of the gene that directs synthesis of the 

1 0 transcript, an siRNA or shRNA targeted to a transcript is also considered to target the 
gene that directs synthesis of the transcript even though the gene itself (i.e., genomic 
DNA) is not thought to interact with the siRNA, shRNA, or components of the 
cellular silencing machinery. Thus as used herein, an siRNA or shRNA that targets a 
gene is understood to target a transcript whose synthesis is directed by the gene. 

15 [0077] The term vector is used herein to refer to a nucleic acid molecule capable 
of mediating entry of, e.g., transferring, transporting, etc., another nucleic acid 
molecule into a cell. The transferred nucleic acid is generally linked to, e.g., inserted 
into, the vector nucleic acid molecule. A vector may include sequences that direct 
autonomous replication, or may include sequences sufficient to allow integration into 

20 host cell DNA. Useful vectors include, for example, plasmids, cosmids, and viral 
vectors. Useful viral vectors include, e.g., replication defective retroviruses, 
adenoviruses, adeno-associated viruses, and lentiviruses. As will be evident to one of 
ordinary skill in the art, viral vectors may include various viral components in 
addition to nucleic acid(s) that mediate entry of the transferred nucleic acid. Thus the 

25 term viral vector may refer either to a virus or viral particle capable of transferring a 
nucleic acid into a cell or to the transferred nucleic acid itself. In particular, the terms 
"lentiviral vector", "lentiviral expression vector" may be used to refer to lentiviral 
transfer plasmids and/or lentiviral particles of the invention as described below. 

30 Detailed Description of Certain Preferred Embodiments of the Invention 

[0078] Retroviruses and retroviral vectors 
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[0079] The retrovirus family consists of a group of viruses with a diploid RNA 
genome that is reverse transcribed during the viral life cycle to yield a double- 
stranded DNA intermediate that stably integrates into the chromosomal DNA of a 
host cell. The integrated DNA intermediate is referred to as a provirus. As used 
5 herein, a provirus is "derived from" a virus or viral particle that delivers the nucleic 
acid from which the proviral DNA is reverse transcribed to the cytoplasm of the cell. 
The retroviral genome and proviral DNA include three genes referred to as gag.pol, 
and env, flanked by two long terminal repeat sequences (LTRs). The 5' and 3 5 LTRs 
contain elements that promote transcription (promoter-enhancer elements) and 

10 polyadenylation of viral RNA. The LTRs also include additional cis-acting sequences 
required for viral replication. In addition, the viral genome includes a packaging 
signal referred to as psi OF) that is necessary for encapsidation (packaging) of the 
retroviral genome. As used herein, a packaging signal or psi sequence is any 
sequence sufficient to direct packaging of a nucleic acid whose sequence comprises 

15 the packaging signal. This includes naturally occurring psi sequences and also 
engineered variants thereof. 

[0080] Briefly, the normal infective cycle begins when the virus attaches to the 
surface of a susceptible cell through interaction with one or more cell surface 
receptors. The virus fuses with the cell membrane, and the viral core is delivered to 
20 the cytoplasm, where the viral matrix and capsid become dismantled, releasing the 
viral genome. Viral reverse transcriptase copies the RNA genome into DNA, which 
moves into the nucleus, where its integration into host cell DNA is catalyzed by the 
viral integrase enzyme. 

[0081] Once integrated into a host genome, viral DNA can remain dormant for 
25 long periods of time. When activated, the viral DNA is transcribed by host cell RNA 
polymerase. The resulting transcript is both a genome for a new virion and a 
transcript from which viral gag and gag-pol polyproteins are synthesized. These 
polyproteins are later processed into the matrix (MA), capsid (CA), and nucleocapsid 
(NC) proteins (in the case of gag), or the matrix, capsid, protease (PR), reverse 
30 transcriptase (RT), and integrase (INT) proteins (in the case of gag-pol). The full- 
length viral RNA transcript also yields transcripts that act as templates for synthesis 
of other viral proteins including envelope glycoproteins and, in the case of 
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lentiviruses, a number of regulatory proteins via various splicing events. Newly made 
Gag and Gag-Pol polyproteins associate with one another, with complete viral 
genomes, and with envelope proteins in the cell membrane so that a new viral particle 
begins to assemble at the membrane. As assembly continues, the structure extrudes 
5 from the cell, thereby acquiring a lipid coat punctuated with envelope glycoproteins. 
Further discussion of the retroviral life cycle and features and descriptions of 
retrovirus classification and taxonomy may be found in Coffin, J., et ah (eds.), 
Retroviruses, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1997, and in 
Fields, B., et al, Fields 9 Virology, 4 th . ed., Philadelphia: Lippincott Williams and 
10 Wilkins; ISBN: 0781718325, 2001. See also the Web site having URL 

www.ncbi.nlm.nih.gov/ICTVdb/ICTVdB, accessed October 11, 2002, providing a 
classification and information about viruses, of which retroviruses are entry 61 and 
lentiviruses are entry 61.0.6. 

[0082] The ability of retroviruses to enter host cells and to mediate the integration 
15 of heterologous nucleic acid sequences into the cellular genome (transduction) has led 
to their widespread use for in vitro and in vivo transfer and expression of nucleic 
acids, a process often referred to as gene transfer. However, the heterologous nucleic 
acid need not be a gene and need not encode a protein. As used herein, the term 
"gene transfer" refers to transfer of any nucleic acid. A transferred nucleic acid is 
20 "expressed" in a cell if the introduction of the nucleic acid into the cell results, either 
directly or indirectly (such as via reverse transcription, integration, transcription, and, 
in some cases, translation) in the presence of an expression product of the nucleic acid 
(e.g., an RNA transcript and/or a polypeptide) within the cell. 
[0083] Advantages of retroviral vector systems include: (i) efficient entry of 
25 genetic material (the vector genome) into cells; (ii) an efficient process of entry into 
the target cell nucleus; (iii) relatively high levels of gene expression in many settings; 

(iv) minimal pathological effects on target cells in the case of many retroviruses; and 

(v) the potential to target particular cellular subtypes through control of the vector- 
target cell binding and tissue-specific control of gene expression (e.g., using tissue- 

30 specific promoters and/or enhancers). 

[0084] In using a retrovirus for gene transfer, a foreign (not part of the wild type 
virus) sequence (e.g., a gene of interest) may be inserted into the retroviral genome in 
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place of wild type retroviral sequences. When the retrovirus delivers its genome to a 
cell, the foreign sequence is also introduced into the cell and may then be integrated 
into the host's cellular DNA as in the case of a wild type retroviral genome. The 
sequence may then be transcribed by the host cell's transcriptional machinery. If the 
5 sequence includes a coding region, translation of the sequence within the host results 
in expression of the encoded protein by the host cell. The features described above 
have made retroviral vectors particularly attractive for gene therapy although they 
may be used in numerous other applications as described below. 
[0085] In order to improve their safety, many recombinant retroviruses designed 

10 for gene transfer are replication defective, i.e., the genome does not encode functional 
fomis of all the proteins necessary for the complete infective cycle. For example, 
sequences encoding the structural proteins may be mutated or deleted. In particular, 
part or all of the sequence encoding the structural proteins may be replaced by a 
different nucleic acid sequence, i.e., a nucleic acid sequence that is to be introduced 

15 into a target cell. However, the packaging signal remains intact. The nucleic acid 

sequence may include a promoter or its transcription may be under control of the viral 
LTR promoter-enhancer. In order to produce infectious viral particles that can be 
used to deliver the recombinant genome to cells, the required viral proteins are 
provided in trans. This may be accomplished using a variety of approaches as further 

20 described below. 

[0086] Lentiviruses and lentiviral vectors 

[0087] Lentiviruses are a family of retroviruses that differ from the simple 
retroviruses described above in that their genome includes any of a variety of genes in 
addition to Gag, Pol, and Env and may also include various regulatory elements. The 

25 additional genes encode typically include regulatory proteins such as Vif, Vpr, Vpu, 
Tat, Rev, and Nef. (For a discussion of various transcripts present at different times 
during the life cycle of HIV, see, for example, Kim et al., J. Virol. 63:3708, 1989, 
incorporated herein by reference). Further discussion of the lentiviral life cycle and 
features and descriptions of lentivirus classification and taxonomy may be found in 

30 Coffin, J., et al (eds.), Retroviruses, Cold Spring Harbor Laboratory Press, Cold 

Spring Harbor, 1997, and in Fields, B., et al 9 Fields' Virology, 4 th . ed., Philadelphia: , 
Lippincott Williams and Wilkins; ISBN: 0781718325, 2001. 
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[0088] The fact that retroviruses cannot effectively direct integration into the 
genome of nondividing cells has limited their use for introducing genes into many 
important targets such as liver, skeletal muscle, heart, brain, retina, and various cells 
of the hematopoietic system. In contrast, lentiviruses are able to productively infect 
5 and transduce nondividing cells, which has motivated the development of lentiviral 
vectors for gene transfer 20 ' 21 . For example, lentiviruses are able to infect resting T 
cells, dendritic cells, and macrophages. Lentiviral vectors can also transfer genes to 
hematopoietic stem cells with a superior gene transfer efficiency and without 
affecting the repopulating capacity of these cells. Lentiviral vectors can also 

10 transduce liver, skeletal muscle, retina, and neuronal cells. See, e.g., Mautino and 
Morgan, AIDS Patient Care STDS 2002 Jan;16(l): 11-26; Somia, N., et al. J. Virol 
74(9): 4420-4424, 2000; Miyoshi, H., et al., Science 283: 682-686, 1999; US patent 
6,013,516, and references 21 and 24. In addition, lentiviruses display reduced 
susceptibility to developmental silencing relative to simple retroviruses (24). This 

15 feature enables their use for the creation of transgenic animals, which is impractical 
with simple retroviruses because developmental silencing results in low or 
undetectable levels of trausgene expression. 

[0089] As mentioned above, to enhance safety recombinant retroviruses and 
lentiviruses designed for gene transfer are typically replication defective, i.e., the 

20 genome does not encode functional forms of all the proteins necessary for the 

complete infective cycle. The necessary proteins are therefore provided in trans. 
According to one approach, these proteins are provided by a packaging cell that has 
been engineered to produce the proteins. Methods for preparing packaging cell lines 
that express retrovirus proteins are well known in the art (See, e.g., U.S. Pat. No. 

25 4,650,764 to Temin et al 9 U.S. Patent No. 5,955,331 to Danos, et al 9 Sheridan et al, 
Molecular Therapy 2(3):262-275, Sep., 2000). Known packaging cell lines include 
¥2, PA137, and PA12, among others. 

[0090] In the absence of a nucleic acid sequence containing appropriate packaging 
signals, the packaging cell produces empty virions. When a nucleic acid sequence 
30 containing appropriate packaging signals is present within the packaging cell (as may 
be achieved by either stably or transiently transfecting the cell with a construct 
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capable of directing transcription of such a sequence), the sequence can be packaged, 
yielding infectious viral particles. The resulting cell is referred to as a producer cell. 
[0091] In the context of certain embodiments of the present invention, a 
packaging cell will comprise a host cell containing packaging-signal defective nucleic 
5 acid sequence(s) coding for retroviral protein(s). The cell is thus able to produce 
retroviral protein(s) but unable to produce replication-competent infectious virus. 
Packaging cells may be created by transfecting a host cell (e.g., a human 293T cell) 
with one or more nucleic acid sequences encoding such protein(s) according to known 
procedures. Any suitable combination of expression cassettes capable of driving 
10 synthesis of the required proteins is sufficient. Typically the packaging cell line 
contains (i) a modified retroviral genome encoding functional Gag and Gag-Pol 
polyproteins but unable to produce functional envelope protein; and (ii) a sequence 
encoding an envelope protein. 

[0092] The various proteins need not all originate from the same viral species. 

1 5 For example, the Gag and Pol proteins may be derived from any of a wide variety of 
retroviruses or lentiviruses. According to certain preferred embodiments of the 
invention the gag and pol proteins are derived from a lentivirus. According to certain 
embodiments of the invention the gag and pol proteins are derived from HIV. Many 
different types of host cell may be used, provided that the cells are permissive for 

20 transcription from the promoters employed. Suitable host cells include, for example, 
293 cells and derivatives thereof such as, 293. T, 293FT (Invitrogen), 293F, etc., 
NHBT3 cells, etc. In general, any mammalian cell that supports transfection and can 
be grown in sufficient quantities can be used. One of ordinary skill in the art will be 
able to select appropriate host cells. 

25 [0093] Although an envelope derived from the same retrovirus or lentivirus from 
which the other viral proteins are derived can be used (homologous envelope) the use 
of a nonhomologous envelope protein such as the VSV G glycoprotein significantly 
reduces or eliminates the possibility of generating wild-type virus during vector 
manufacturing or after introduction of the vectors into host cells. Thus one useful 

30 class of lentiviral vectors consists of replication-defective, hybrid viral particles made 
from the core proteins and enzymes of a lentivirus and the envelope of a different 
virus such as the vesicular stomatitis virus (VSV) or the Moloney leukemia virus. 
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[0094] Safety considerations prompted development of alternative approaches to 
the production of recombinant retroviral and lentiviral particles capable of infecting 
and transducing cells. According to an approach described in U.S. Patent Number 
6,013,516 and also in references 19 and 20, three different constructs may be used to 
5 produce the recombinant lentiviral particles. Two of these constructs provide 
packaging functions, one containing sequences encoding the core proteins and 
enzymes of the lentivirus and the other containing sequences encoding the envelope 
protein of a different related or unrelated virus. 

[0095] The third construct, referred to herein as a transfer construct, transfer 
10 vector, or transfer plasmid, includes a cloning site for insertion of a heterologous 
nucleic acid (i.e., a sequence not derived from the lentivirus) in addition to the cis- 
acting viral sequences that are necessary for certain aspects of the viral life cycle such 
as encapsidation, reverse transcription, and integration. 

[0096] The three plasmid system, which does not require helper virus, and use of 
15 a heterologous envelope improve the safety of the vector by reducing the likelihood 
that a replication-competent recombinant could be generated. In addition, removal of 
various non-essential cis-acting sequences and the discovery that sequences encoding 
certain viral proteins can be removed while still allowing efficient gene transfer 
further contributes to the safety of this system. These advances are reviewed in 
20 reference 21 and articles listed therein, all of which are incorporated herein by 
reference. 

[0097] The present invention provides new lentiviral transfer plasmids, new 
replication-defective lentiviruses, and new lentiviral expression systems. Maps of 
exemplary lentiviral transfer constructs of the invention are provided in Figures 2 

25 through 9 and corresponding sequences are provided as SEQ ID NOS: 2 through 9. 
However, the invention is not limited to these specific embodiments. Figure 2 shows 
a map of one of the transfer plasmids of the invention in which nucleotide 0 is 
indicated. For purposes of description, nucleotides are numbered in a clockwise 
direction with reference to nucleotide 0, and elements having lower nucleotide 

30 numbers are considered 5' to elements having higher nucleotide numbers. Thus, for 
example, the CMV element is 5 5 to all other elements shown. Note that various 
elements depicted in the maps are not shown to scale. Also, the presence of a 
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particular element on a map is not intended to indicate that the entire element is 
necessarily present. For example, according to certain embodiments of the invention 
a portion of the 5' LTR is deleted. 

[0098] According to certain embodiments of the invention the lentiviral transfer 
5 plasmids are HIV-based lentiviral transfer plasmids. As used herein, a lentiviral 

plasmid is said to be "based on" a particular lentivirus species (e.g., HIV-1) or group 
(e.g., primate lentivirus group) if at least 50% of the lentiviral sequences found in the 
plasmid are derived from a lentivirus of that particular species or group, alternately, if 
the transfer plasmid displays greater identity or homology to a lentivirus of that 

10 particular species or group than to other known lentiviruses. Thus a HIV-based 

lentiviral transfer plasmid is a transfer plasmid in which at least 50% of the lentiviral 
sequences are derived from (i.e., originate from), either HIV-1 or HIV-2 or, 
alternately, if the transfer plasmid displays greater identity or homology to HIV-1 or 
HIV-2 than to other known lentiviruses. In cases where the origin of any given 

15 sequence is unknown, the likelihood that it is derived from a particular lentivirus may 
be determined by sequence comparison using, e.g., programs such as BLAST, 
BLASTNR, or CLUSTALW (or variations thereof) in a comprehensive database such 
as GenBank, Unigene, etc., can be performed using, e.g., default parameters and 
matrices (e.g., BLOSUM substitution matrix). (BLAST is described in Altschul, SF, 

20 et al., Basic local alignment search tool, J. MoL Biol, 215(3): 403-410, 1990, Altchul, 
SF and Gish, W, Methods in Enzymology. 

[0099] The invention provides a lentiviral transfer plasmid whose sequence 
comprises a nucleic acid sequence including (i) a functional packaging signal; (ii) a 
multiple cloning site (MCS); and (iii) at least one additional element selected from the 

25 group consisting of: a second MGS, a second MCS into which a heterologous 

promoter or promoter-enhancer is inserted, an HIV FLAP element, an expression- 
enhancing posttranscriptional regulatory element, a target site for a site-specific 
recombinase, and a self-inactivating (SIN) LTR . It is to be understood that the target 
site for a site-specific recombinase is in addition to any site(s) required for integration 

30 of the lentiviral genome. In other words, in those embodiments of the invention in 
which the additional element is a target site for a site-specific recombinase, the 
lentiviral transfer plasmid will typically also include target sites for the corresponding 
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lentiviral integrase (which normally exist within the LTRs). In particular, the 
invention provides (1) a lentiviral transfer construct as described immediately above 
wherein the additional element is a second MCS; (2) a lentiviral transfer construct as 
described immediately above wherein the additional element is a second MCS in 
5 which a heterologous promoter or promoter-enhancer is inserted; (3) a lentiviral 

transfer construct as described immediately above wherein the additional element is 
an HIV FLAP element; (4) a lentiviral transfer construct as described immediately 
above wherein the additional element is an expression-enhancing posttranscriptional 
regulatory element such as the woodchuck hepatitis virus regulatory element (WRE); 

10 (5) a lentiviral transfer construct as described immediately above wherein the 

additional element is a recombination site for a site-specific recombinase; and (6) a 
lentiviral transfer construct as described immediately above wherein the additional 
element is a SIN LTR. The lentiviral transfer plasmid may also comprise one or 
more heterologous promoters, enhancers, or promoter-enhancers. 

1 5 [00100] The invention further provides lentiviral transfer plasmids containing at 
least two, at least three, at least four, at least five, or all of these additional elements. 
In particular, the invention provides a lentiviral tranfer plasmid comprising a nucleic 
acid sequence that includes (i) a functional packaging signal; (ii) a multiple cloning 
site (MCS); (iii) a second MCS; (iv) a second MCS in which a heterologous promoter 

20 or a heterologous promoter-enhancer is inserted; (v) an HIV FLAP element; (vi) a 
WRE; (vii) two loxP sites; and a self-inactivating (SIN) LTR . The invention also 
encompasses lentiviral transfer plasmids as described above in which a heterologous 
nucleic acid is inserted at a site within an MCS. It will be appreciated that insertion of 
such a sequence separates the MCS into two parts. 

25 [00101] According to preferred embodiments of the invention the transfer plasmid 
includes the cis-acting sequence elements required to support reverse transcription of 
a lentiviral genome and also the cis-acting sequence elements necessary for the 
packaging and integration of a lentiviral genome. These sequences typically include 
the Psi Q¥) packaging sequence, reverse transcription signals, integration signals, 

30 promoter or promoter/enhancer, polyadenylation sequence, tRNA binding site, and 
origin for second strand DNA synthesis. According to certain embodiments of the 
invention the transfer plasmid contains a Rev Response Element (RRE) such as that 
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located at positions 7622-8459 in the HIV NL4-3 genome (Genbank accession 
number AF003887). Of course RREs from other strains of HIV can also be used. 
Such sequences are readily available from Genbank or from the database having URL 
hiv-web.lanl.gov/content/index. According to certain embodiments of the invention 
5 the transfer plasmid contains a 5 f HIV R-U5-del gag element such as that located at 
positions 454-1 126 in the HIV NL4-3 genome. According to preferred embodiments 
of the invention the transfer plasmid contains a sequence encoding a selectable marker 
such as the ampicillin resistance gene (Amp R ) and an origin of replication that allows 
the plasmid to replicate within bacterial cells, such as the pUC origin. Various 
10 features and elements mentioned above (and others) are more fully described in the 
following sections. 

[00102] Lentiviral genome sequences. The lentiviral transfer plasmids may include 
lentiviral sequences derived from any of a wide variety of lentiviruses including, but 
not limited to, primate lentivirus group viruses such as human immunodeficiency 

15 viruses HIV-1 and HIV-2 or simian immunodeficiency virus (SIV); feline lentivirus 
group viruses such as feline immunodeficiency virus (FIV); ovine/caprine 
immunodeficieny group viruses such as caprine arthritis encephalitis virus (CAEV); 
bovine immunodeficiency-like virus (BIV); equine lentivirus group viruses such as 
equine infectious anemia virus; and visna/maedi virus. It will be appreciated that each 

20 of these viruses exists in multiple variants or strains. 

[00103] According to certain preferred embodiments of the invention most or all of 
the lentiviral sequences are derived from HIV-1. For example, according to certain 
embodiments of the invention the lentiviral backbone of the transfer plasmids is 
derived from an HIV-1 -based transfer plasmid such as that described in reference 29 

25 or derivatives thereof such as those described in reference 24. However, it is to be 
understood that many different sources of lentiviral sequences can be used, and 
numerous substitutions and alterations in certain of the lentiviral sequences may be 
accommodated without impairing the ability of the transfer plasmid to perform the 
functions described herein, and such variations are within the scope of the invention. 

30 The ability of any particular lentiviral transfer plasmid to transfer nucleic acids and/or 
to generate a lentiviral particle capable of infecting and transducing cells in the 
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presence of the required viral proteins may readily be tested by methods known in the 
art, some of which will be evident from the Examples. 

[00104] Unique restriction sites and multiple cloning sites. The invention provides 
new lentiviral transfer plasmids incorporating sites for a variety of different restriction 
5 enzymes. In particular, the invention provides lentiviral transfer constructs including 
one or more multiple cloning sites (MCS), e.g., one MCS or two MCSs. As is well 
known in the art, a multiple cloning site, also referred to as a polylinker, or 
polycloning site, is a cluster of cloning sites such that many restriction enzymes 
operate within the site. A cloning site as used herein is a known sequence, preferably 

10 the only one on the plasmid, (i.e., it is a unique sequence on the plasmid) upon which 
a restriction enzyme operates to linearize or cut the plasmid. Restriction sites for 
numerous restriction enzymes are known in the art and are listed, for example, in the 
catalogs of various manufacturers such as New England Biolabs, Promega, Beoringer- 
Ingelheim, etc. For purposes of the present invention a restriction site is unique if it is 

15 recognized as such in the art or, alternately, if the enzyme displays at least a 5-fold 
greater likelihood of cutting at the unique site than at any other site in the plasmid 
under standard digestion conditions. 

[00105] Typically an MCS is less than approximately 100 nucleotides in length 
(measured from the most 5' nucleotide in the most 5' restriction site to the most 3' 

20 nucleotide in the most 3 5 restriction site, and including both of these nucleotides) and 
contains at least 4 unique restriction sites. According to certain embodiments of the 
invention an MCS is less than approximately 100 nucleotides in length. According to 
certain embodiments of the invention an MCS is less than approximately 75 
nucleotides in length. According to certain embodiments of the invention an MCS is 

25 less than approximately 50 nucleotides in length. According to certain embodiments 
of the invention the transfer plasmid comprises an MCS containing at least 5 unique 
restriction sites. According to other embodiments of the invention the transfer 
plasmid comprises an MCS containing at least 6 unique restriction sites. According to 
yet other embodiments of the invention the transfer plasmid comprises an MCS 

30 containing at least 7 unique restriction sites. According to yet other embodiments of 
the invention the transfer plasmid comprises an MCS containing at least 8 unique 
restriction sites. According to yet other embodiments of the invention the transfer 
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plasmid comprises an MCS containing at least 9 unique restriction sites. According to 
yet other embodiments of the invention the transfer plasmid comprises at least two 
MCSs, each of which contains at least 7 unique restriction sites. The invention 
provides a lentiviral transfer plasmid containing an MCS that includes a site for a 
5 restriction enzyme that leaves a blunt end after cutting. The invention further 

provides a lentiviral transfer plasmid containing an MCS that includes a restriction 
site that has an 8 bp recognition sequence. 

[00106] The invention provides a lentiviral transfer plasmid having unique 
restriction sites for at least 4 enzymes selected from the group consisting of NotI, 

10 Apal, Xhol, Xbal, Hpal, Nhel, Pad, Nsil, SphI, Sma/Xma, AccI, BamHI, and SphL 
The invention further provides a lentiviral transfer plasmid having unique restriction 
sites for at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 1 1, at 
least 12, or at least 13 enzymes selected from the group consisting of NotI, Apal, 
Xhol, Xbal, Hpal, Nhel, Pad, Nsil, SphI, Sma/Xma, AccI, BamHI, and SphL The 

15 invention further provides collections of two or more of any of the lentiviral transfer 
plasmids described above. According to certain embodiments of the invention any of 
the lentiviral transfer plasmids described above are HIV-based transfer plasmids. 
[00107] HIV FLAP element According to certain embodiments of the invention 
the transfer plasmid includes an HIV FLAP element. This sequence contains 

20 structural elements associated with the process of reverse transcription and 

encompasses the central polypurine tract and central termination sequences (cPPT 
and CTS). As described in Zennou, et al, Cell, 101, 173, (2000), during HIV-1 
reverse transcription, central initiation of the plus-strand DNA at the central 
polypurine tract (cPPT) and central termination at the central termination sequence 

25 (CTS) lead to the formation of a three-stranded DNA structure: the HIV-1 central 

DNA flap. While not wishing to be bound by any theory, the DNA flap may act as a 
cis-active determinant of lentiviral genome nuclear import and/or may increase the 
titer of the virus. 

[00108] Expression-stimulating posttranscriptional regulatory element. The 
30 invention provides lentiviral transfer plasmids comprising any of a variety of 

posttranscriptional regulatory elements characterized in that their presence within a 
transcript increases expression of the heterologous nucleic acid at the protein level. 
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According to certain embodiments of the invention the posttranscriptional regulatory 
element is the woodchuck hepatitis virus regulatory element (WRE) as described in 
Zufferey, R., et al, J. Virol, 73, 2886, 1999. Other posttranscriptional processing 
elements that may be used include the posttranscriptional processing element present 
5 within the genome of various viruses such as that present within the thymidine kinase 
gene ofherpes simplex virus (Liu, X. ? and J. E. Mertz. Genes Dev. 9:1766-1780, 
1995), and the posttranscriptional regulatory element (PRE) present in hepatitis B 
virus (HBV) (Huang, Z. M., and T. S. Yen, Mol Cell Biol 5:3864-3869, 1995). 
According to the invention the posttranscriptional regulatory element is positioned so 

10 that a heterologous nucleic acid inserted into the transfer plasmid in the 5' directly 
from the element will result in production of a transcript that includes the 
posttranscriptional regulatory element at the 3 5 end. Figure 2 shows an example of a 
transfer plasmid incorporating the WRE downstream of sites for insertion of one or 
more heterologous nucleic acid sequences. Figure 6 shows an example of a transfer 

15 plasmid in which a heteologous nucleic acid encoding EGFP has been inserted in the 
5' direction from the WRE and the ubiqutin C (UbC) promoter has been inserted 
upstream of the sequence encoding EGFP. This configuration results in synthesis of a 
transcript whose 5 5 portion comprises EGFP coding sequences and whose 3 3 portion 
comprises the WRE sequence. 

20 [00109] Long terminal repeats. According to certain embodiments of the invention 
the transfer plasmid includes a self-inactivating (SIN) LTR (29). As is known in the 
art, during the retroviral life cycle, the U3 region of the 3' LTR is duplicated to form 
the corresponding region of the 5' LTR in the course of reverse transcription and viral 
DNA synthesis. Creation of a SIN LTR is achieved by inactivating the U3 region of 

25 the 3 ' LTR (preferably by deletion of a portion thereof as described in reference 29). 
The alteration is transferred to the 5' LTR after reverse transcription, thus eliminating 
the transcriptional unit of the LTRs in the pro virus, which should prevent 
mobilization by replication competent virus. An additional safety enhancement is 
provided by replacing the U3 region of the 5 5 LTR with a heterologous promoter to 

30 drive transcription of the viral genome during production of viral particles. 

Appropriate promoters include, e.g., the CMV promoter. Preferred promoters are able 
to drive high levels of transcription in a Tat-independent manner. This replacement 
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further reduces the possibility of recombination to generate replication-competent 
virus because there is no complete U3 sequence in the virus production system. Thus 
in certain embodiments of the invention the transfer plasmid includes a self- 
inactivating (SIN) 3' LTR. In certain embodiments of the invention the transfer 
5 plasmid includes a 5' LTR in which the U3 region is replaced with a heterologous 
promoter. The heterologous promoter drives transcription during transient 
transfection but after reverse transcription it gets replaced by a copy of U3 from the 3' 
LTR, which in the case of a SIN LTR contains a deletion that makes it unable to drive 
transcription. Thus all transcription is driven by the internal promoter after 
10 integration. 

[00110] According to certain embodiments of the invention one or both LTRs 
contain sequences that can be used to introduce insulator sequences into the vectors. 
In general, insulators are elements that can help to preserve the independent function 
of genes or transcription units embedded in a genome or genetic context in which 

15 their expression may otherwise be influenced by regulatory signals within the genome 
or genetic context. See, for example, Burgess-Beusse B, et al., Proc. Natl. Acad. Sci. 
published August 1, 2002, 10.1073/pnas. 162342499 and Zhan HC, et al., Hum Genet, 
Nov;109(5):471-8, 2001. In the context of the present invention, insulators "protect" 
the lentivirus-expressed sequences from integration site effects, which are mediated 

20 by cis-acting elements present in genomic DNA, and lead to deregulated expression 
of transferred sequences. The invention provides transfer plasmids in which an 
insulator sequence is inserted into one or both LTRs. 

[00111] Heterologous promoters and promoter/enhancers. Any of a wide variety 
of heterologous promoter and promoter/enhancer elements may be included in the 

25 transfer plasmids and used to direct transcription of a heterologous nucleic acid 

sequence in cells infected with the recombinant lentiviral particles of the invention or 
cells into which the transfer plasmids of the invention have been introduced, e.g., by 
transfection. According to certain embodiments of the invention the transfer plasmids 
and lentiviral particles include a single heterologous promoter. In other embodiments 

30 two or more heterologous promoters are included. The promoters may be in the same 
or in opposite orientation. 
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[00112] One of ordinary skill in the art will readily be able to selet appropriate 
promoters depending upon the particular application. For example, sometimes it will 
be desirable to achieve constitutive, non-tissue specific, high level expression of a 
heterologous nucleic acid sequence. For such purposes viral promoters or 
5 promoter/enhancers such as the SV40 promoter, CMV promoter or 

promoter/enhancer, etc., may be employed. Mammalian promoters such as the beta- 
actin promoter, ubiquitin C promoter, elongation factor la promoter, tubulin 
promoter, etc., may also be used. If the plasmids are to be used in non-mammalian 
cells, appropriate promoters for such cells should be selected. 

10 [00113] It may be desirable to achieve cell type specific or tissue-specific 

expression of a heterologous nucleic acid sequence (e.g., to express a particular 
heterologous nucleic acid in only a subset of cell types or tissues or during specific 
stages of development), tissue-specific promoters may be used. For example, it may 
be desirable to achieve conditional expression in the case of transgenic animals or for 

15 therapeutic applications, including gene therapy. As used herein, the term "tissue 
specific promoter" refers to a regulatory element (e.g., promoter, promoter/enhancer 
or portion thereof) that preferentially directs transcription in only a subset of cell or 
tissue types, or during discrete stages in the development of a cell, tissue, or organism. 
A tissue specific promoter may direct transcription in only a single cell type or in 

20 multiple cell types (e.g., two to several different cell types). Numerous tissue-specific 
promoters are known, and one of ordinary skill in the art will readily be able to 
identify tissue specific promoters (or to determine whether any particular promoter is 
a tissue specific promoter) from the literature or by performing experiments such as 
Northern blots, immunoblots, etc. in which expression of either an endogenous gene 

25 or a reporter gene operably linked to the promoter is compared in different cell or 
tissue types). For example, the nestin, neural specific enolase, NeuN, and GFAP 
promoters direct transcription in various neural or glial lineage cells; the keratin 5 
promoter directs transcription in keratinocytes; the MyoD promoter directs 
transcription in skeletal muscle cells; the insulin promoter directs transcription in 

30 pancreatic beta cells; the CYP450 3A4 promoter directs transcription in hepatocytes! 
The invention therefore provides lentiviral transfer plasmids as described above 
comprising a tissue-specific promoter and methods of using the transfer plasmids and 
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lentiviral particles derived therefrom to achieve cell type or tissue specific expression. 
Preferred promoters are active in mammalian cells. According to certain 
embodiments of the invention the tissue-specific promoter is specific for brain (e.g., 
neurons), liver (e.g., hepatocytes), pancreas, skeletal muscle (e.g., myocytes), immune 
5 system cells (e.g., T cells, B cells, macrophages), heart (e.g., cardiac myocytes), 
retina, skin (e.g., keratinocytes), bone (e.g., osteoblasts or osteoclasts), etc. 
[00114] It may be desirable to achieve conditional expression of a heterologous 
nucleic acid sequence (e.g., to control expression of a particular heterologous nucleic 
acid by subjecting a cell, tissue, organism, etc., to a treatment or condition that causes 

10 the heterologous nucleic acid to be expressed or that causes an increase or decrease in 
expression of the heterologous nucleic acid), for which purpose a variety of inducible 
promoters and systems. In particular, it may be desirable to achieve conditional 
expression in the case of transgenic animals or for therapeutic applications, including 
gene therapy. See, e.g., Haviv YS and Curiel DT, A dv Drug Deliv Rev, 53(2): 135-54, 

15 2001, describing approaches for achieving conditional gene expression in cancer cells. 
As used herein, "conditional expression" may refer to any type of conditional 
expression including, but not limited to: inducible expression; repressible expression; 
expression in cells or tissues having a particular physiological, biological, or disease 
state, etc. This definition is not intended to exclude cell type or tissue-specific 

20 expression, since the type of cell or tissue may also be considered a condition. 

[00115] One approach to achieving conditional expression involves the use of 
inducible promoters. As used herein, the term "inducible promoter" refers to a 
regulatory element (e.g., a promoter, promoter/enhancer or portion thereof) whose 
transcriptional activity may be regulated by exposing a cell or tissue containing a 

25 nucleic acid sequence operably linked to the promoter to a treatment or condition that 
alters the transcriptional activity of the promoter, resulting in increased transcription 
of the nucleic acid sequence. For convenience, as used herein, the term "inducible 
promoter" also includes repressible promoters, i.e., promoters whose transcriptional 
activity may be regulated by exposing a cell or tissue containing a nucleic acid 

30 sequence operably linked to the promoter to a treatment or condition that alters the 
transcriptional activity of the promoter, resulting in decreased transcription of the 
nucleic acid sequence. Preferred inducible promoters are active in mammalian cells. 
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Inducible promoters include, but are not limited to steroid-inducible promoters such 
as the promoters for the genes encoding the glucocorticoid or estrogen receptors 
(inducible by treatment with the corresponding hormone), metallothionine promoter 
(inducible by treatment with various heavy metals), MX-1 promoter (inducible by 
5 interferon), etc. The invention therefore provides lentiviral transfer plasmids as 
described above comprising a tissue-specific promoter and methods of using the 
transfer plasmids and lentiviral particles derived therefrom to achieve cell type or 
tissue specific expression. 

[00116] Another approach to achieving conditional expression involves use of 

10 binary transgenic systems, in which gene expression is controlled by the interaction of 
two components: a "target" transgene and an "effector" transgene, whose product acts 
on the target transgene. See, e.g., Lewandoski, M., Nature Reviews Genetics 2, 743- 
755 (2001) and articles referenced therein, all of which are incorporated herein by 
reference, reviewing methods for achieving conditional expression in mice. In 

15 general, binary transgenic systems fall into two categories. In the first type of system, 
the effector transactivates transcription of the target transgene. For example, in the 
tetracycline-dependent regulatory systems (Gossen, M. & Bujard, H, Proc. Natl Acad. 
Sci. USA 89, 5547-5551 (1992), the effector is a fusion of sequences that encode the 
VP 16 transactivation domain and the Escherichia coli tetracycline repressor (TetR) 

20 protein, which specifically binds both tetracycline and the 19-bp operator sequences 
(tetO) of the tet operon in the target transgene, resulting in its transcription. In the 
original system, the tetracycline-controlled transactivator (tTA) cannot bind DNA 
when the inducer is present, while in a modified version, the 'reverse tTA' (rtTA) 
binds DNA only when the inducer is present ('tet-on') (Gossen, M. et aL 9 Science 268, 

25 1766-1769(1995)). The current inducer of choice is doxycycline (Dox). The 
invention therefore provides lentiviral transfer plasmids as described above 
comprising a tetracycline-controlled transactivator or reverse tetracycline-controlled 
transactivator, lentiviral transfer plasmids comprising operator sequences of the tet 
operon to which the tetracycline-controlled transactivator or reverse tetracycline- 

30 controlled transactivator specifically bind, and methods of using the transfer plasmids 
and lentiviral particles derived therefrom to achieve conditional expression, including 
the generation of transgenic animals in which conditional expression is achieved. 
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[00117] In the second type of system, the effector is a site-specific DNA 
recombinase that rearranges the target gene, thereby activating or silencing it. These 
systems are described below. In order to achieve conditional expression in cells or 
tissues having a particular physiological, biological, or disease state, a promoter that 
5 is selectively active in cells or tissue having that particular physiological, biological, 
or disease state may be used. 

[00118] As described further below, one application for the lentiviral transfer 
plasmids and lentiviral expression systems of the invention is to direct transcription of 
RNAs that hybridize or self-hybridize to form siRNAs or shRNAs in cells, e.g., 

10 mammalian cells. For these purposes in certain embodiments of the invention it is 
preferred to use a PolIII promoter such as the U6 or HI promoter. Therefore, the 
invention provides lentiviral transfer plasmids and lentiviral particles optimized for 
siRNA, i.e., lentiviral transfer plasmids and lentiviral particles comprising a PolIII 
promoter, e.g., the U6 or HI promoter. According to certain embodiments of the 

15 invention the PolIII promoter is inducible. It is noted that Pol II promoters can also 
be used to achieve intracellular expression of siRNA or sliRNA (Xia, H., et al, Nat. 
Biotech., 20: 1006-1010, 2002), and the lentiviral vectors described herein may be 
used in this manner. 

[00119] Transfer plasmid size. As described in further detail in Example 1, by 
20 removing certain dispensable sequences the inventors have created lentiviral transfer 
plasmids having reduced size relative to previously known lentiviral transfer 
plasmids, which results in a number of advantages. First, the reduced size of the 
transfer constructs adds to their ease of manipulability. Second, the reduced size adds 
to their flexibility. As is known in the art, there is a limit to the size of retroviral 
25 genomes that can be efficiently packaged. Generally it is preferable to limit the size 
of the transcript for packaging (distance between 5' and 3 ' UTRs) to less than 
approximately 8-10 kB. Thus removal of the dispensable sequences allows the 
insertion of larger heterologous sequence(s) without compromising the ability of the 
resulting genomic transcript to be packaged during the production of lentiviral 
30 particles. As used herein in reference to retroviral and lentiviral vectors, a "genome" 
or "genomic transcript" generally refers to a transcript that contains sufficient 
packaging signals to allow packaging. It does not imply that the transcript need 
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contain all or even most of the genetic information found in a wild type virus. In 
general, the sequence of a genomic transcript will depend on the location of the 
promoter upstream of the packaging sequence and the location of the polyadenylation 
site downstream of the packaging sequence. 
5 [00120] The invention provides a lentiviral transfer plasmid having a length less 
than 10 kilobases (kB). The invention provides a lentiviral transfer plasmid having a 
length less than 9 kB. The invention provides a lentiviral transfer plasmid having a 
length less than 8 kB). The invention provides a lentiviral transfer plasmid having a 
length less than 7 kB). The invention provides a lentiviral transfer plasmid having a 

10 length less than 6.5 kB). The invention provides a lentiviral transfer plasmid having a 
length of approximately 6 kB). Generally, unless otherwise evident from the context, 
the term "approximately" means that the value may deviate by 10% or less from the 
numeral given, and the ranges listed are assumed to include both endpoints. The 
invention further provides collections of lentiviral plasmids having a length less than 

15 10 kB, a length less than 9 kB, a length less than 8 kB, or a length less than 7 kB. 

[00121] In particular, the invention provides a lentiviral transfer plasmid having a 
length less than 8 kB and comprising one or more heterologous nucleic acid 
sequences. According to certain embodiments of the invention the heterologous 
nucleic acid sequence is a promoter or promoter/enhancer such as the CMV promoter, 

20 the CMV promoter/enhancer, or the Ubiquitin C promoter. According to certain 

embodiments of the invention the promoter is the U6 or HI promoter. According to 
certain embodiments of the invention the heterologous nucleic acid sequence is a 
reporter gene, e.g., a gene encoding EGFP or dsRed2. The invention particularly 
provides a lentiviral transfer plasmid having a length of approximately 6.0 kB 

25 comprising at least one MCS, two LoxP sites, an HIV FLAP element, and a WRE. 
[00122] Transfer plasmid sequence information. The inventors have recognized 
that prior art lentiviral vector systems suffered from a dearth of sequence information. 
As will be readily appreciated by one of ordinary skill in the art, regardless of the 
particular nature of a transfer plasmid, it is desirable to have complete and accurate 

30 sequence information. Such information makes it possible, for example, to readily 
determine the identity of all restriction sites, to design primers for amplification of 
particular plasmid sequences or for other purposes such as the introduction of 
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mutations, etc. In addition, the availability of complete sequence information makes 
it possible to identify determinants of plasmid function, e.g., by engineering mutations 
at specific sites and observing the effect on, for example, packaging, integration, 
transcription, etc. Accordingly, the invention provides a fully sequenced lentiviral 
5 transfer plasmid, wherein the sequence is deposited in a publicly accessible database. 
By "fully sequenced" is meant that the complete nucleotide sequence of the plasmid is 
known. By "publicly accessible database" is meant Genbank, or any other database 
that can be accessed by the public without requiring a fee. In particular, the invention 
provides a fully sequenced lentiviral transfer plasmid comprising the sequence set 

10 forth in any of the following SEQ ID NOS: SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID 
NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID 
NO: 9. In addition, the invention provides a collection of lentiviral transfer plasmids 
including at least two of the plasmids having SEQ ID NOS: SEQ ID NO: 2, SEQ ID 
NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 

15 8, or SEQ ID NO: 9. In addition, the invention provides a lentiviral transfer plasmid 
having a sequence that differs by not more than 100 nucleotides from the sequence set 
forth in SEQ ID NOS: SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9. The invention 
further provides a lentiviral transfer plasmid having a sequence that differs by not 

20 more than X nucleotides from the sequence set forth in SEQ ID NOS: SEQ ID NO: 2, 
SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ 
ID NO: 8, or SEQ ID NO: 9, where X represents any number between 1 and 99, 
inclusive. By "a sequence that differs by not more than X nucleotides (where X is 
any number) from the sequence of SEQ ID NO: Y" is meant any sequence that can be 

25 obtained from SEQ ID NO: Y by either inserting, deleting, and/or altering less than X 
nucleotides of SEQ ID NO: Y. 

[00123] Recombination sites for site-specific recombinase. According to certain 
embodiments of the invention the transfer plasmid includes at least one (typically 
two) site(s) for recombination mediated by a site-specific recombinase. Site-specific 
30 recombinases catalyze the introduction or excision of DNA fragments from a longer 
DNA molecule. These enzymes recognize a relatively short, unique nucleic acid 
sequence, which serves for both recognition and recombination. Typically the 
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recombination site is composed of short inverted repeats (6, 7 or 8 base pairs in 
length) and the length of the DNA-binding element is typiclly 1 1-13 bp in length. 
[00124] In general, the transfer plasmids may contain one or more recombination 
sites for any of a wide variety of site-specific recombinases. As mentioned above, it 
5 is to be understood that the target site for a site-specific recombinase is in addition to 
any site(s) required for integration of the lentiviral genome. According to various 
embodiments of the invention the transfer plasmid includes one or more sites for a 
recombinase enzyme selected from the group consisting of Cre, XerD, HP1 and Flp. 
These enzymes and their recombination sites are well known in the art. See, for 

10 example, Sauer, B. & Henderson, N., Nucleic Acids Res. 17, 147-161 

(1989), Gorman, C. and Bullock, C, Curr. Op. BiotechnoL, 11(5): 455-460, 2000, 
O'Gorman, S., Fox, D. T. & Wahl, G. M., Science 251, 1351-1355 (1991) and Kolb, 
A., Cloning Stem Cells, 4(l):65-80, 2002, and U.S. Patent 4,959,317. See also Kuhn, 
R., and Torres, RM, Methods Mol Biol 2002;180:175-204. 

1 5 [00125] These recombinases catalyse a conservative DNA recombination event 
between two 34-bp recognition sites (loxP and FRT, respectively). Placing a 
heterologous nucleic acid sequence operably linked to a promoter element between 
two loxP sites (in which case the sequence is "floxed") allows for controlled 
expression of the heterologous sequence following transfer into a cell. By inducing 

20 expression of Cre within the cell, the heterologous nucleic acid sequence is excised, 
thus preventing further transcription and effectively eliminating expression of the 
sequence. This system has a number of applications including Cre-mediated gene 
activation (in which either heterologous or endogenous genes may be activated, e.g., 
by removal of an inhibitory element or a polyadenylation site), creation of transgenic 

25 animals exhibiting temporal control of Cre expression, cell-lineage analysis in 

transgenic animals, and generation of tissue-specific knockouts or knockdowns in 
transgenic animals. 

[00126] According to certain embodiments of the invention the transfer plasmid 
includes two loxP sites. Furthermore, in preferred embodiments of the invention the 
30 transfer plasmid includes a cloning site, e.g., a unique restriction site, between the two 
loxP sites, which allows the convenient insertion of a heterologous nucleic acid 
sequence. According to certain embodiments of the invention the transfer plasmid 
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includes a MCS between the two loxP sites. According to certain embodiments of the 
invention the two loxP sites are located between an HIV FLAP element and a WRE. 
According to certain embodiments of the invention the plasmid contains a unique 
restriction site between the 3' loxP site and the WRE. 
5 [00127] As described above, positioning the heterologous nucleic acid sequence 
between loxP sites allows for controlled expression of the heterologous sequence 
following transfer into a cell. By inducing Cre expression within the cell, the 
heterologous nucleic acid sequence is excised, thus preventing further transcription 
and effectively eliminating expression of the sequence. Cre expression may be 

10 induced in any of a variety of ways. For example, Cre may be present in the cells 
under control of an inducible promoter, and Cre expression may be induced by 
activating the promoter. Alternately, Cre expression may be induced by introducing 
an expression vector that directs expression of Cre into the cell. Any suitable 
expression vector can be used, including, but not limited to, viral vectors such as 

15 adenoviral vectors. (The phrase "inducing Cre expression" as used herein refers to 
any process that results in an increased level of Cre within a cell.) 
[00128] The invention thus provides a method for achieving controlled expression 
of a heterologous nucleic acid sequence comprising steps of inserting the 
heterologous nucleic acid sequence into a transfer plasmid of the invention between 

20 sites for a recombinase, thereby producing a modified transfer plasmid; introducing 
the modified transfer plasmid or a portion thereof including at least the sites for the 
recombinase and the region between the sites into a cell and; subsequently inducing 
expression of the recombinase within the cell. According to certain embodiments of 
the invention the cell is a mammalian cell. According to certain embodiments of the 

25 invention the recombinase is Cre and the sites for the recombinase are loxP sites. In 
accordance with the invention the transfer plasmid may be introduced into the cell 
using standard techniques such as transfection. Alternately, the transfer plasmid may 
be used to generate a lentiviral particle that includes a lentiviral genome comprising 
the recombinase sites and the region between them. As described elsewhere herein, 

30 the genome integrates into the cell's DNA and directs expression of the heterologous 
nucleic acid sequence. The cell may be used for any of a variety of purposes as 
described in more detail below. 
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[00129] The lentiviral transfer plasmids comprising two loxP sites are useful in any 
applications for which standard vectors comprising two loxP sites can be used. For 
example, selectable markers may be placed between the loxP sites. This allows for 
sequential and repeated targeting of multiple genes to a single cell (or its progeny). 
5 After introduction of a transfer plasmid comprising a floxed selectable marker into a 
cell, stable transfectants may be selected. After isolation of a stable transfectant, the 
marker can be excised by induction of Cre. The marker may then be used to target a 
second gene to the cell or its progeny. Lentiviral particles comprising a lentiviral 
genome derived from the transfer plasmids may be used in the same manner. 

10 [00130] As another example, standard gene-targeting techniques may be used to 
produce a mouse in which an essential region of a gene of interest is floxed, so that 
tissue-specific Cre expression results in the inactivation of this allele. The transfer 
plasmids may be introduced into cells (e.g., ES cells) using pronuclear injection. 
Alternately, the cells may be injected or infected with lentiviral particles comprising a 

1 5 lentiviral genome derived from the transfer plasmid. Tissue -specific Cre expression 
may be achieved by crossing a mouse line with a conditional allele (i.e., a floxed 
nucleic acid sequence) to an effector mouse line that expresses cre in a tissue-specific 
manner, so that progeny are produced in which the conditional allele is inactivated 
only in those tissues or cells that express Cre. Suitable transgenic lines are known in 

20 the art and may be found, for example, in the Cre Transgenic Database at the Web site 
having URL wvvw.mshri.on.ca/nagy/Cre-pub.html. 

[00131] Internal ribosome entry sequence (IRES). The transfer plasmids may also 
include an IRES. IRES elements function as initiators of the efficient translation of 
reading frames. An IRES allows ribosomes to start the translation process anew with 

25 whatever is immediately downstream and regardless of whatever was upstream. In 
particular, an IRES allows for the translation of two different genes on a single 
transcript. For example, an IRES allows the expression of a marker such as EGFP off 
the same transcript as a transgene, which has a number of advantages: (1) The 
transgene is native and does not have any fused open reading frames that might affect 

30 function; (2) Since the EGFP is from the same transcript its levels should be an 

accurate representation of the levels of the upstream transgene. IRES elements are 
known in the art and are further described in Kim, et al., Molecular and Cellular 



Page 40 of 171 



WO 2004/022722 



PCT/US2003/028111 



Biology 12(8):3636-3643 (August 1992) and McBratney, et al., Current Opinion in 
Cell Biology 5:961-965 (1993). 

[00132] Any of a wide variety of sequences of viral, cellular, or synthetic origin 
which mediate internal binding of the ribosomes can be used as an IRES. Examples 
5 include those IRES elements from polio virus Type I, the 5 ! UTR of 

encephalomyocarditis virus (EMV), of Thelier ! s murine encephalomyelitis virus 
(TMEV) of foot and mouth disease virus (FMDV) of bovine enterovirus (BEV) ? of 
coxsackie B virus (CBV), or of human rhino virus (HRV), or the human 
immunoglobulin heavy chain binding protein (BIP) 5'UTR, the Drosophila 
10 antennapediae 5'UTR or the Drosophila ultrabithorax 5 ? UTR, or genetic hybrids or 
fragments from the above-listed sequences. 

[00133] Transfer plasmids incorporating heterologous nucleic acids. The invention 
provides new lentiviral transfer constructs incorporating a variety of heterologous 
nucleic acids (also referred to as heterologous sequences or heterologous nucleic acid 

15 segments), preferably operably linked to a promoter or promoter/enhancer element. 
These sequences may be inserted at any available site within the transfer plasmid 
including, but not limited to, at a restriction site within a MCS. In general, the 
inserted nucleotide sequence may be any nucleotide sequence and may be a naturally 
occurring sequence or variant thereof or an artificial sequence. Heterologous gene 

20 sequences of the present invention may comprise one or more gene sequences that 
already possess one or more regulatory elements such as promoters, initiation 
sequences, processing sequences, etc. Alternatively, such regulatory elements may be 
present within the transfer plasmid prior to insertion of the heterologous sequence, 
[00134] According to certain embodiments of the invention the inserted 

25 heterologous sequence is a reporter gene sequence. A reporter gene sequence, as used 
herein, is any gene sequence which, when expressed, results in the production of a 
protein whose presence or activity can be monitored. Suitable reporter gene 
sequences include, but are not limited to, sequences encoding chemiluminescent or 
fluorescent proteins such as green fluorescent protein (GFP) and variants thereof such 

30 as enhanced green fluorescent protein (EGFP); cyan fluorescent protein; yellow 

fluorescent protein; blue fluorescent protein; dsRed or dsRed2, luciferase, aequorin, 
etc. Many of these markers and their uses are reviewed in van Roessel, P. and Brand, 
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A., Nature Cell Biology, 4(1), El 5-20, 2002, and references therein, all of which are 
incorporated herein by reference. Additional examples of suitable reporter genes 
include the gene for galactokinase, beta-galactosidase, chloramphenicol 
acetyltransferase, beta-lactamase, etc. Alternatively, the reporter gene sequence may 
5 be any gene sequence whose expression produces a gene product which affects cell 
physiology or phenotype. In general, a reporter gene sequence typically encodes a 
protein that is not normally present within a cell into which the transfer plasmid is to 
be introduced. 

[00135] According to certain embodiments of the invention the inserted 
10 heterologous sequence is a selectable marker gene sequence, which term is used 

herein to refer to any gene sequence capable of expressing a protein whose presence 
permits the selective maintenance and/or propagation of a cell which contains it. 
Examples of selectable marker genes include gene sequences capable of conferring 
host resistance to antibiotics (e.g., puromycin, ampicillin, tetracycline, kanamycin, 
15 and the like), or of conferring host resistance to amino acid analogues, or of 
permitting the growth of cells on additional carbon sources or under otherwise 
impermissible culture conditions. A gene sequence may be both a reporter gene and a 
selectable marker gene sequence. In general, preferred reporter or selectable marker 
gene sequences are sufficient to permit the recognition or selection of the plasmid in 
20 normal cells. 

[00136] The heterologous sequence may also comprise the coding sequence of a 
desired product such as a biologically active protein or polypeptide (e.g., a 
therapeutically active protein or polypeptide) and/or an immunogenic or antigenic 
protein or polypeptide. Introduction of the transfer plasmid into a suitable cell thus 

25 results in expression of the protein or polypeptide by the cell. Alternatively, the 
heterologous gene sequence may comprise a nucleic acid segment that provides a 
template for transcription of an antisense RNA, a ribozyme, or, preferably, one or 
more strands of a short interfering RNA (siRNA) or a precursor thereof such as a 
short hairpin RNA (shRNA). As described further below, siRNAs and shRNAs 

30 targeted to cellular transcripts inhibit expression of such transcripts. Introduction of 
the transfer plasmid into a suitable cell thus results in production of the siRNA or 
shRNA, which inhibits expression of the target transcript. 
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[00137] Three and four plasmid lentiviral expression systems. The invention 
further provides a recombinant lentiviral expression system comprising three 
plasmids. The first plasmid is constructed to contain mutations that prevent 
lentivirus-mediated transfer of viral genes. Such mutations may be a deletion of 
5 sequences in the viral env gene, thus preventing the generation of replication- 
competent lentivirus, or may be deletions of certain cis-acting sequence elements at 
the y end of the genome required for viral reverse transcription and integration. Thus 
even if viral genes from this construct are packaged into viral particles, they will not 
be replicated and replication-competent wild-type viruses will not be produced. The 

10 first plasmid (packaging plasmid) comprises a nucleic acid sequence of at least part of 
a lentiviral genome, wherein the vector (i) contains at least one defect in at least one 
gene encoding a lentiviral structural protein, and (ii) lacks a functional packaging 
signal. The second plasmid (Env-coding plasmid) comprises a nucleic acid sequence 
of a virus, wherein the vector (i) expresses a viral envelope protein, and (ii) lacks a 

15 functional packaging signal. The third plasmid may be any of the inventive transfer 
plasmids described above. The first and second plasmids are further described below, 
and schematic diagrams of relevant portions of representative first and second 
plasmids (packaging and Env-coding) are presented in Figure 10A, which is taken 
from reference 21. The third plasmid (not shown) is a transfer plasmid. 

20 [00138] Packaging plasmid. In certain embodiments of the invention the first 

vector is a gag/pol expression vector, i.e., a plasmid capable of directing expression of 
functional forms of a retroviral gag gene product and a retroviral Pol gene product. 
These proteins are necessary for assembly and release of viral particles from cells. 
The first plasmid may also express sequences encoding various accessory lentiviral 

25 proteins including, but not limited to, Vif, Vpr, Vpu, Tat, Rev, and Nef. In particular, 
the first plasmid may express a sequence encoding Rev. In general, the gag and pol 
sequences may be derived from any retrovirus, and the accessory sequences may be 
derived from any lentivirus. According to certain embodiments of the invention the 
gag and pol sequences and any accessory sequences are derived from HIV-1 . It is 

30 noted that the gag, pol, and accessory protein sequences need not be identical to wild 
type versions but instead may contain mutations, deletions, etc., that do not 
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significantly impair the ability of the protein to perform its function in the viral life 
cycle. 

[00139] The first plasmid is preferably constructed to contain mutations that 
exclude retroviral-mediated transfer of viral genes. Such mutations may be a deletion 
5 or mutation of sequences in the viral env gene, thus excluding the possibility of 

generating replication-competent lentivirus. Alternatively, or in addition to, deletion 
or mutation of env, according to certain embodiments of the invention the plasmid 
sequence may contain deletions of certain cis- acting sequence elements at the 3 f end 
of the genome required for viral reverse transcription and integration. Accordingly, 

10 even if viral genes from this construct are packaged into viral particles, they will not 
be replicated and replication-competent wild-type viruses will not be generated. Any 
of a wide variety of packaging plasmids may be used in the three plasmid lenti viral 
expression system of the invention including, but not limited to, those described in 
references 21, 24, 29, and 40. 

15 [00140] Env-coding plasmid. This plasmid directs expression of a viral envelope 
protein and, therefore, comprises a nucleic acid sequence encoding a viral envelope 
protein under the control of a suitable promoter. The promoter can be any promoter 
capable of directing transcription in cells into which the plasmid is to be introduced. 
One of ordinary skill in the art will readily be able to select an appropriate promoter 

20 among, for example, the promoters mentioned above. For example, according to 

i 

certain embodiments of the invention a CMV promoter is used. The Env-coding 
plasmid preferably contains any additional sequences needed for efficient 
transcription, processing, etc., of the env transcript including, but not limited to, a 
polyadenylation signal such as any of those mentioned above. 

25 [00141] The host range of cells that the viral vectors of the present invention can 
infect may be altered (e.g., broadened) by utilizing an envelope gene from a different 
virus. Thus is possible to alter or increase the host range of the vectors of the present 
invention by taking advantage of the ability of the envelope proteins of certain viruses 
to participate in the encapsidation of other viruses. In a preferred embodiment of the 

30 present invention, the G-protein of vesicular-stomatitis virus (VSV-G; see, e.g., Rose 
and Gillione, J. Virol. 39, 519-528 (1981); Rose and Bergmann, Cell 30, 753-762 
(1982)), or a fragment or derivative thereof, is the envelope protein expressed by the 
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second plasmid. VSV-G efficiently forms pseudotyped virions with genome and 
matrix components of other viruses. As used herein, the term "pseudotype" refers to a 
viral particle that contains nucleic acid of one virus but the envelope protein of 
another virus. Li general, VSV-G pseudotyped viruses have a very broad host range, 
5 and may be pelleted to titers of high concentration by ultracentrifugation (e.g., 

according to the method of J. C. Bums, et al., Proc. Natl. Acad. Sci. USA 90, 8033- 
8037 (1993)), while still retaining high levels of infectivity. 
[00142] Additional envelope proteins that may be used in accordance with the 
present invention include, but are not limited to, ecotropic or amphotropic MLV 

10 envelopes, 10A1 envelope, truncated forms of the HIV env, GALV, BAEV, SIV, 

FeLV-B, RD1 14, SSAV, Ebola, Sendai, FPV (Fowl plague virus), and influenza virus 
envelopes. Similarly, genes encoding envelopes from RNA viruses (e.g. RNA virus 
families of Picornaviridae, Calciviridae, Astro viridae, Togaviridae, Flaviviridae, 
Coronaviridae, Paramyxoviridae, Rhabdoviridae, Filoviridae, Orthomyxoviridae, 

15 Bunyaviridae, Arenaviridae, Reoviridae, Birnaviridae, Retro viridae) as well as from 
the DNA viruses (families of Hepadnaviridae, Circo viridae, Parvo viridae, 
Papovaviridae, Adenoviridae, Herpesviridae, Poxviridae, and Iridoviridae) may be 
utilized. Representative examples include FIV, FeLV, RSV, VEE, HFVW, WDSV, 
SFV, Rabies, ALV, BIV, BLV, EBV, CAEV, HTLV, SNV, ChTLV, STLV, MPMV, 

20 SMRV, RAV, FuSV, MH2, AEV, AMV, CT10, EIAV. In addition to the above, 
hybrid envelopes (e.g. envelope comprising regions of more than one of the above), 
may be employed. According to certain embodiments of the invention the envelope 
recognizes a unique cellular receptor (e.g., a receptor found only on a specific cell 
type or in a specific species), while according to certain other embodiments of the 

25 invention the envelope recognizes multiple different receptors. According to certain 
embodiments of the invention the second plasmid encodes a cell or tissue specific 
targeting envelope. Cell or tissue specific targeting may be achieved, for example, by 
incorporating particular sequences within the envelope sequence (e.g., sequences 
encoding ligands for cell or tissue-specific receptors, antibody sequences, etc.). Thus 

30 any of a wide variety of Env-coding plasmids may be used in the three plasmid 
lentiviral expression system of the invention including, but not limited to, those 
described in references 21, 24, 29, and 40. 
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[00143] Variations on the three plasmid system. The invention further provides a 
four plasmid lentiviral expression system comprising a three plasmid lentiviral 
expression system as described herein and a fourth plasmid comprising a nucleic acid 
sequence encoding the Rev protein (in which case the rev gene is generally not 
5 included in the other plasmids. As mentioned above, the presence of Rev increases 
the level of transcription during production of lentiviral particles. It will be 
appreciated that a variety of alternative three or four plasmid systems may be 
employed while maintaining the feature that no sequence of recombination event(s) 
between only two of the three or four plasmids is sufficient to generate replication- 

10 competent virus. For example, either Gag or Pol or any of the accessory proteins may 
be encoded by the plasmid referred to as the Env-coding plasmid. Alternately, Gag, 
Pol, or any of the accessory proteins may be encoded by the transfer plasmid. In 
addition, sequences encoding Rev may be provided on the same plasmid that encodes 
Gag, Pol, or Env. According to certain embodiments of the invention sequences 

15 encoding a functional Tat protein are absent from the plasmids, and sequences 

encoding Rev are provided on a separate plasmid rather than on the same plasmid as 
sequences encoding other viral genes, as described in reference 40. Schematic 
diagrams of relevant portions of representative first and second plasmids (packaging 
and Env-coding) and fourth plasmid encoding Rev are presented in Figure 10B, which 

20 is taken from reference 40. The third plasmid (not shown) is a transfer plasmid. 

[00144] Applications of the lentiviral transfer plasmids and expression systems. 
The lentiviral transfer plasmids and lentiviral expression systems of the invention 
have a wide variety of uses, some of which have been described above. As will be 
evident, the transfer plasmids may be used for any application in which a 

25 conventional expression plasmid is employed. As described in Examples 3 through 6, 
the transfer plasmids of the invention are able to drive expression of heterologous 
genes (e.g., EGFP) when transfected into cells and are also able to drive synthesis of 
shRNA when transfected into cells. 

[00145] The presence of one or more MCSs means that the plasmids may 
30 conveniently be used for insertion and subsequent expression of any heterologous 

sequence. In particular, the transfer plasmids that include an insertion site such as an 
MCS between sites for a recombinase such as loxP may be used for easy assembly of 
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a promoter-site-sequence-site cassette, (where "site" indicates a recombination site for 
arecombinase and "sequence" indicates a heterologous sequence of interest), e.g., a 
promoter-loxP-sequence-loxP site that can then be moved into another vector. It is 
noted that the transfer plasmids can be used to direct expression of a heterologous 
5 nucleic acid in a variety of eukaryotic cells other /than mammalian cells, provided a 
promoter capable of directing transcription in such cells is employed. Thus references 
to "mammalian cells" herein should not be understood to exclude non-mammalian 
cells, as long as an appropriate promoter for transcription in non-mammalian cells is 
provided. 

10 [00146] Introducing plasmids into cells. In general, the plasmids described herein 
may be introduced into cells via conventional transformation or transfection 
techniques. As used herein, the terms "transformation" and "transfection" are 
intended to refer to a variety of art-recognized techniques for introducing foreign 
nucleic acid (e.g., DNA or RNA) into cells, including calcium phosphate or calcium 

15 chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, injection, 
or electroporation. 

[00147] Production of replication-defective lentiviral particles. In general, the 
transfer plasmids and the three-plasmid recombinant lentiviral expression systems of 
the invention may be used to produce infectious, replication-defective lentiviral 

20 particles according to methods known to those skilled in the art. In the case of the 
recombinant lentiviral expression system of the invention the methods include (i) 
transfecting a lentiviras-permissive cell with the three-plasmid lentiviral expression 
system of the present invention; (ii) producing the lentivirus-derived particles in the 
transfected cell; and (iii) collecting the virus particles from the cell. The step of 

25 transfecting the lentivirus-permissive cell can be carried out according to any suitable 
means known to those skilled in the art. For example, the three-plasmid expression 
system described herein may be used to generate lentivirus-derived retroviral vector 
particles by transient transfection. The plasmids may be introduced into cells by any 
suitable means, including, but not limited to, calcium phosphate or calcium chloride 

30 co-precipitation, DEAE-dextran-mediated transfection, lipofection, injection, or 
electroporation. 
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[00148] The transfer plasmids of the invention may be used to produce infectious, 
replication-defective lentiviral particles in a similar maimer using helper cells that 
express the necessary viral proteins as known in the art and mentioned above. In 
general, the transfer plasmids may be used to produce infectious, replication-defective 
5 lentiviral particles in conjunction with any system using any combination of plasmids 
and/or helper cell lines that provides the appropriate combination of required genes: 
gag, pol, env, and, preferably, rev in cases where transcription occurs from a gag/pol 
expression cassette containing a Rev-response element (or alternately a system that 
supplies the various proteins encoded by these genes). 

10 [00149] Infectious virus particles may be collected using conventional techniques. 
For example, the infectious particles may be collected by cell lysis, or collection of 
the supernatant of the cell culture, as is known in the art. Optionally, the collected 
virus particles may be purified if desired. Suitable purification techniques are well 
known to those skilled in the art. Methods for titering virus particles are also well 

15 known in the art. Further details are provided in the Examples. 

[00150] Producer cell lines. As will be evident, when a host cell permissive for 
production of lentiviral particles is transfected with the plasmids of the three-plasmid 
system, the cell becomes a producer cell, i.e., a cell that produces infectious lentiviral 
particles. Similarly, when a helper cell that produces the necessary viral proteins is 

20 transfected with a transfer plasmid of the invention, the cell becomes a producer cell. 
The invention therefore provides producer cells and corresponding producer cell lines 
and methods for the production of such cells and cell lines. In particular, the 
invention provides a method of creating a producer cell line comprising introducing a 
transfer plasmid of the invention into a host cell; and introducing a packaging plasmid 

25 and an envelope plasmid into the host cell. The invention provides another method of 
creating a producer cell line comprising introducing a transfer plasmid of the 
invention into a helper cell that produces viral proteins necessary for encapsidation of 
a lentiviral genome and subsequent infectivity of a lentiviral particle resulting from 
encapsidation. 

30 [00151] The inclusion of appropriate genetic elements from various papovaviruses 
allows plasmids to be maintained as episomes within mammalian cells. Such 
plasmids are faithfully distributed to daughter cells. In particular, viral elements of 
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various polyomaviruses and papillomaviruses such as BK virus (BKV), bovine 
papilloma virus 1 (BPV-1) and Epstein-Barr virus (EBV), among others, are useful in 
this regard. The invention therefore provides lentiviral transfer plasmids comprising a 
viral element sufficient for stable maintenance of the transfer plasmid as an episome 
5 within mammalian cells. Appropriate genetic elements and their use are described, 
for example, in Van Craenenbroeck, et al. 9 Eur. J. Biochem. 267, 5665-5678 (2000) 
and references therein, all of which are incorporated herein by reference. 
[00152] The invention further provides cell lines comprising the transfer plasmids 
described above, i.e., cell lines in which the transfer plasmids are stably maintained as 
10 episomes. In particular, the invention provides producer cell lines (cell lines that 

produce the proteins needed for production of infectious lentiviral particles) in which 
the transfer plasmids are stably maintained as episomes. According to certain 
embodiments of the invention these cell lines constitutively produce lentiviral 
particles. 

1 5 [00153] According to other embodiments of the invention one or more of the 
necessary viral proteins is under the control of an inducible promoter. Thus the 
invention provides helper cell lines in which the transfer plasmids are stably 
expressed as episomes, wherein at least one viral protein expressed by the cell line is 
under control of an inducible promoter. This allows the cells to be expanded under 

20 conditions that are not permissive for viral production. Once the cells have reached a 
desired density (e.g., confluence), or a desired cell number, etc., the protein whose 
expression is under control of the inducible promoter can be induced, allowing 
production of viral particles to begin. This system offers a number of advantages. In 
particular, since every cell has the required components, titer is increased. In 

25 addition, it avoids the necessity of performing a transfection each time a particular 
virus is desired. Any of a variety of inducible promoters known in the art may be 
used. One of ordinary skill in the art will readily be able to select an appropriate 
inducible promoter and apply appropriate techniques to induce expression therefrom. 
[00154] The invention thus provides a method of producing lentiviral particles 

30 comprising introducing a lentiviral transfer plasmid of the invention, which lentiviral 
transfer plasmid comprises a genetic element (e.g., a viral element) sufficient for 
stable maintenance of the transfer plasmid as an episome in mammalian cells, into a 
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helper cell that produces proteins needed for production of infectious lentiviral 
particles and; culturing the cell for a period sufficient to allow production of lentiviral 
particles. The invention further provides a method of producing lentiviral particles 
comprising introducing a lentiviral transfer plasmid of the invention, which lentiviral 
5 transfer plasmid comprises a genetic element sufficient for stable maintenance of the 
transfer plasmid as an episome in mammalian cells, into a helper cell that expresses a 
protein required for production of lentiviral particles, wherein expression of the 
protein is under control of an inducible promoter; inducing expression of the protein 
required for production of lentiviral particles; and culturing the cell for a period 

1 0 sufficient to allow production of lentiviral particles. 

[00155] Transgenic and knockout animals. The transfer plasmids may be used to 
generate stable transgenic or knockout animals, wherein the transgene is a 
heterologous nucleic acid contained in the transfer plasmid. Transgenic animals may 
be generated through standard (non- viral) means such as pronuclear injection of the 

1 5 transfer plasmid. In addition, the lentiviral particles may be used to create transgenic 
animals wherein the transgene is a heterologous nucleic acid contained in the 
lentiviral particle. For example, lentiviral particles of the invention may be injected 
into the perivitelline space of single-cell embryos, which may then be implanted and 
carried to term. Alternately, the zona pellucida may be removed and the denuded 

20 embryo incubated with lentiviral suspension prior to implantation as described in 
reference 24. This approach offers a more efficient method of creating a variety of 
transgenic animals, e.g., birds, rats, and other mammals. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes 

25 a transgene. Other examples of transgenic animals include non-human primates, 
sheep, dogs, cows, goats, chickens, amphibians, and the like. Transgenic animals 
typically carry a gene which has been introduced into the germline of the animal, or 
an ancestor of the animal, at an early (usually one-cell) developmental stage. In 
general, a transgene is heterologous DNA, which preferably is integrated into or 

30 occurs in the genome of the cells of a transgenic animal. Integration of the transgene 
may lead to a deletion of endogenous chromosomal DNA, e.g., by homologous 
recombination, such that the function of an expression product of the DNA is 
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impaired or eliminated. In this case the resulting animal is referred to as a knockdown 
or knockout animal. Note that transgene sequences may include endogenous 
sequences but typically also include additional sequences that do not naturally occur 
in the animal. 

5 [00156] As described in Example 7, the inventors have generated transgenic mice 
using a lentiviral particle comprising a heterologous nucleic acid encoding the 
fluorescent protein GFP, which serves as a transgene. The lentiviral particles were 
able to induce expression of GFP within embryonic stem cells (ES cells), and these 
ES cells gave rise to transgenic animals whose cells expressed GFP. These results 
10 demonstrate that heterologous nucleic acids contained in the lentiviral particles of the 
invention are not subject to developmental silencing. 

[00157] Constitutive, conditional, reversible, and tissue-specific expression. The 
transfer plasmids and lentiviral particles of the invention may be used to achieve 
constitutive, conditional, reversible, or tissue-specific expression in cells, tissues, or 

15 organisms, including transgenic animals. The invention provides a method of 

reversibly expressing a transcript in a cell comprising: (i) delivering a lentiviral vector 
to the cell, wherein the lentiviral vector comprises a heterologous nucleic acid, and 
wherein the heterologous nucleic acid is located between sites for a site-specific 
recombinase; and (ii) inducing expression of the site-specific recombinase within the 

20 cell, thereby preventing synthesis of synthesis of the transcript within those cells. 
According to certain embodiments of the invention the cell is a mammalian cell. 
According to certain embodiments of the invention the step of inducing the site- 
specific recombinase comprises introducing a vector encoding the site-specific 
recombinase into the cell. According to other embodiments of the invention a nucleic 

25 acid encoding the site-specific recombinase is operably linked to an inducible 

promoter, and the inducing step comprises inducing the promoter as described above. 
As discussed in more detail in Example 8, the inventors have shown that introduction 
of a lentiviral particle comprising a heterologous nucleic acid encoding the 
fluorescent protein EGFP between loxP sites into cells results in expression of EGFP 

30 within the cells. When the EGFP-expressing cells were subsequently infected with an 
adenovirus containing a nucleic acid encoding Cre, thereby inducing expression of 
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Cre within the cells, expression of EGFP was eliminated in a significant proportion of 
the cells. Thus expression of EGFP was reversible. 

[00158] In addition, the invention provides a variety of methods for achieving 
conditional and/or tissue-specific expression. For example, the invention provides a 
5 method for expressing a transcript in a mammal in a cell type or tissue-specific 

manner comprising: (i) delivering a lentiviral transfer plasmid or lentiviral particle to 
cells of the mammal, wherein the lentiviral transfer plasmid or lentiviral particle 
comprises a heterologous nucleic acid, and wherein the heterologous nucleic acid is 
located between sites for a site-specific recombinase; and (ii) inducing expression of 

10 the site-specific recombinase in a subset of the cells of the mammal, thereby 
preventing synthesis of the transcript within those cells. According to certain 
embodiments of the inventive methods the recombinase is Cre. According to certain 
embodiments of the invention the step of inducing the site-specific recombinase 
comprises introducing a vector encoding the site-specific recombinase into the cell. 

15 According to other embodiments of the invention a nucleic acid encoding the site- 
specific recombinase is operably linked to an inducible promoter, and the inducing 
step comprises inducing the promoter as described above. In certain embodiments of 
the invention the nucleic acid encoding the site-specific recombinase is operably 
linked to a cell type or tissue-specific promoter, so that synthesis of the recombinase 

20 takes place only in cells or tissues in which that promoter is active. 

[00159] Gene and transcript silencing. As described in more detail below, the 
invention provides methods of reducing or inhibiting the expression of target genes 
and/or transcripts (which need not necessarily encode proteins) by exploiting the 
phenomenon of RNA interference (RNAi). For example, the invention provides a 

25 method of inhibiting or reducing the expression of a target transcript in a cell 

comprising delivering a lentiviral vector (e.g., a lentiviral transfer plasmid or lentiviral 
particle) to the cell, wherein presence of the lentiviral vector within a cell results in 
synthesis of one or more RNAs that self-hybridize or hybridize with each other to 
form a short hairpin RNA or short interfering RNA that is targeted to the target 

30 transcript. Such lentiviral expression vectors may be used therapeutically to silence 
disease-causing genes and/or render cells resistant to infectious organisms. In 
addition, lentiviral expression vectors may facilitate the creation of animals deficient 
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in immunogenic xenoantigens as sources of organs for organ transplantation. It will 
be appreciated that in those embodiments of the invention in which the nucleic acid 
segment that provides a template for synthesis of the one or more RNAs that self- 
hybridize or hybridize with each other to form an shRNA or siRNA is floxed, 
5 inhibition of the target transcript may be reversed by expressing Cre, thereby excising 
the template for the siRNA or shRNA. Thus the invention allows conditional and 
tissue-specific expression of target transcripts in cells, tissues, or organisms. RNAi 
and methods of using the plasmids and expression systems of the invention for 
achieving RNAi are described below. 

10 [00160] RNA interference 

[00161] Small inhibitory RNAs were first discovered in studies of the phenomenon 
of RNA interference (RNAi) in Drosophila, as described, for example, in WO 
01/75164, etc. It was found that, in Drosophila, long double-stranded RNAs are 
processed by an RNase Ill-like enzyme called DICER (Bernstein et al., Nature 

15 409:363, 2001) into smaller dsRNAs comprised of two 21 nt strands, each of which 
has a 5 5 phosphate group and a 3 5 hydroxyl, and includes a 19 nt region precisely 
complementary with the other strand, so that there is a 19 nt duplex region flanked by 
2 nt-3' overhangs (see Figure 11). These small dsRNAs (siRNAs) act to silence 
expression of any gene that includes a region complementary to one of the dsRNA 

20 strands, presumably because a helicase activity unwinds the 19 bp duplex in the 

siRNA, allowing an alternative duplex to form between one strand of the siRNA and 
the target transcript. This new duplex then guides an endonuclease complex, RISC, to 
the target RNA, which it cleaves ("slices") at a single location, producing unprotected 
RNA ends that are promptly degraded by cellular machinery (see Figure 12). 

25 [00162] Homologs of the DICER enzyme are found in diverse species ranging 
from C. elegans to humans (Sharp, Genes Dev. 15;485, 2001; Zamore, Nat Struct 
Biol 8:746, 2001), raising the possibility that an RNAi-like mechanism might be able 
to silence gene expression in a variety of different cell types including mammalian, or 
even human, cells. However, long dsRNAs (e.g., dsRNAs having a double-stranded 

30 region longer than about 30 - 50 nucleotides) are known to activate the interferon 

response in mammalian cells. Thus, rather than achieving the specific gene silencing 
observed with the Drosophila RNAi mechanism, the presence of long dsRNAs into 
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mammalian cells would be expected to lead to interferon-mediated noii-specific 
suppression of translation, potentially resulting in cell death. Long dsRNAs are 
therefore not thought to be useful for inhibiting expression of particular genes in 
mammalian cells. 

5 [00163] In contrast, siRNAs, when present in mammalian cells, can effectively 
reduce the expression of target transcripts and genes in a specific manner without 
activating the anti-viral response (9, 10). Preferred siRNAs typically include a base- 
paired region approximately 19 nt long, and may further comprise one or more single- 
stranded regions, typically 3' overhangs on one or both strands. Figures 1 1 and 13 

10 presents various structures that can be utilized to mediate RNA interference. Figure 
1 1 shows the siRNA structure found to be active in the Drosophila system and likely 
represents the species that is active in mammalian cells. This structure consists of two 
21 nt strands having a complementary core region of 19 nt and 2 nt 3' overhangs at 
each end of the double-stranded region. Figures 13A, 13B, 13C, and 13D represent 

15 additional structures that may be used to mediate RNA interference. These hairpin 
(stem-loop) structures may function directly as inhibitory RNAs or may be processed 
intracellularly to yield an siRNA structure such as that depicted in Figure 1 1 . 
[00164] Many different RNA species having structures such as these have been 
introduced into mammalian cells and have been shown to reduce expression of target 

20 transcripts. For example, siRNAs targeted to transcripts encoding the HIV Gag 

protein or the the HIV-1 cellular receptor CD4 reduced the level of the corresponding 
mRNAs and proteins Gag in cells infected with HIV (Novina, C, et aL, Nat Med, 
8(7):681-6, 2002), resulting in inhibition of virus production. Studies such as this, 
demonstrating siRNA-mediated inhibition of cellular genes as well as genes of 

25 infectious organisms, demonstrate the therapeutic potential of RNA interference for a 
wide variety of conditions. In addition, the ability to selectively reduce or eliminate 
expression of particular genes has profound implications for the study of gene 
function. 

[00165] In general, preferred siRNAs reduce the target transcript level or level of 
30 the encoded protein at least about 2 fold, preferably at least about 5 fold, more 

preferably at least about 10 fold, at least about 25 fold, at least about 50 fold or to an 
even greater degree relative to the level that would be present in the absence of the 
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inhibitory RNA. In selecting a target sequence for any particular transcript it may be 
desirable to test a variety of siRNAs in order to identify one with an appropriate 
efficacy. 

[00166] In general, an siRNA includes a double-stranded region (the "inhibitory 
5 region"), one strand of which is substantially complementary to a portion of the target 
transcript, so that a precise hybrid can form in vivo between one strand of the siRNA 
and the target transcript. The portion of the target transcript to which the siRNA 
strand hybridizes may be referred to as the target or targeted portion or site. In certain 
preferred embodiments of the invention, the relevant inhibitor region of the siRNA is 

10 perfectly complementary with the target transcript; in other embodiments, one or 
more non-complementary residues are located at or near the ends of the 
siRNA/template duplex or elsewhere. As will be appreciated by those of ordinary 
skill in the art, it is generally preferred that mismatches in the central portion of the 
siRNA/template duplex be avoided (see, for example, Elbashir et al., EMBO J. 

15 20:6877,2001). 

[00167] Generally any portion of a target transcript may be selected as the target 
site, to which the antisense strand of the siRNA will be complementary. It may be 
preferable to select siRNAs that hybridize with a target site that includes exonic 
sequences in the target transcript or hybridizes exclusively with exonic sequences. 

20 Hybridization with intronic sequences is not excluded, but generally appears not to be 
preferred in mammalian cells. An siRNA that hybridizes with a target site that 
includes only sequences within a single exon may be selected, or the target site may 
be created by splicing or other modification of a primary transcript. Any site that is 
available for hybridization with an siRNA antisense strand, resulting in slicing and 

25 degradation of the transcript may be utilized in accordance with the present invention. 
Nonetheless, those of ordinary skill in the art will appreciate that, in some instances, it 
may be desirable to select particular regions of target gene transcript as siRNA 
hybridization targets. For example, it may be desirable to avoid (i) sections of target 
transcript that may be shared with other transcripts whose degradation is not desired; 

30 (ii) sections of target transcript that are identical or homologous to other transcripts 
whose degradation is not desired. In general, coding regions and regions closer to the 
3' end of the transcript than to the 5' end are preferred. The 3' portion of target 
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transcripts may be less likely to exhibit secondary structure that may inhibit or 
interfere with siRNA activity, e.g., by reducing accessibility. 

[00168] In general, preferred siRNA sequences have a GC content between 30 and 
70% or, preferably, between 40 and 60%. In general, it is preferred to avoid target 
5 sequences that contain strings of >2 identical nucleotides (e.g., AAA, GGGG). 

siRNA sequences may conveniently be identified by scanning the cDNA sequence 
from 5' to 3' until an appropriate 19 nucleotide target is identified. If it is desired to 
include a 3' overhang in the antisense strand, the 19 nt sequence should be preceded 
by nucleotides complementary to the desired 3' overhang. For example, according to 
10 certain embodiments of the invention an siRNA sequence should correspond to: 
AAN19. 

[00169] Certain siRNAs hybridize to a target site that includes or consists entirely 
of 3 5 UTR sequences. Such siRNAs may tolerate a larger number of mismatches in 
the siRN A/template duplex, and particularly may tolerate mismatches within the 

15 central region of the duplex. In fact, some mismatches may be desirable as 

siRNA/template duplex formation in the 3' UTR may inhibit expression of a protein 
encoded by the template transcript by a mechanism related to but distinct from classic 
RNA inhibition. In particular, there is evidence to suggest that siRNAs that bind to 
the 3' UTR of a template transcript may reduce translation of the transcript rather than 

20 decreasing its stability. Specifically, as shown in Figure 14, the DICER enzyme that 
generates siRNAs in the Drosophila system discussed above and also in a variety of 
organisms, is known to also be able to process a small, temporal RNA (stRNA) 
substrate into an inhibitory agent that, when bound within the 3' UTR of a target 
transcript, blocks translation of the transcript (see Grishok, A., et al., Cell 106, 23-24, 

25 2001; Hutvagner, G., et al., Science, 293, 834-838, 2001; Ketting, R., et al., Genes 
Dev., 15, 2654-2659). For the purposes of the present invention, any partly or fully 
double-stranded short RNA as described herein, one strand of which binds to a target 
transcript and reduces its expression (i.e., reduces the level of the transcript and/or 
reduces synthesis of the polypeptide encoded by the transcript) is considered to be an 

30 siRNA, regardless of whether the RNA acts by triggering degradation, by inhibiting 
translation, or by other means. In certain preferred embodiments of the invention, 
reducing expression of the transcript involves degradation of the transcript. In 
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addition any precursor structure (e.g., a short hairpin RNA, as described herein) that 
may be processed in vivo (i.e., within a cell or organism) to generate such an siRNA is 
useful in the practice of the present invention. 

[00170] Use of RNAi in mammalian cells, tissues, and organisms is currently 
5 restricted by the limited delivery methods available. siRNAs can be delivered to cells 
by various means, such as electroporation (1 1), use of lipofectants (10), or expression 
of short hairpin RNAs (shRNAs) in cells from a plasmid template (11-16). sliRNAs 
are precursors of siRNAs, and typically comprise dsRNA stretches of at least 19 bp 
separated by a loop of several non self-complementary nucleotides. shRNAs adopt 

10 stem-loop structures, thought to be recognized and processed into siRNAs by the 
conserved cellular RNAi machinery (17; Ketting, R., et al., Genes Dev., 15, 2654- 
2659). While the approaches mentioned above have been successful at targeting gene 
expression in cell culture systems, in general they are not as readily applicable to 
primary cells, which are difficult to transfect by standard methods such as those 

15 mentioned above. Their use to target gene expression in mammalian subjects is also 
problematic. A further limitation of introducing siRNAs into cells by standard means 
is that the inhibitory (knockdown) effect is transient, as mammalian cells appear to 
lack the siRNA amplification mechanisms that confer RNAi potency and longevity in 
lower organisms (10). 

20 [00171] The present invention encompasses the recognition that use of lentiviral 

expression systems offer a means of overcoming problems associated with delivery of 
siRNAs into mammalian cells and tissues, including primary mammalian cells and 
tissues, nondividing cells (including neurons and naive T cells), and cells at early 
stages of development such as embryonic cells (including embryonic stem cells). The 

25 invention further emcompasses the recognition that use of lentiviral vectors offers a 
means of overcoming problems associated with delivery of siRNAs into mammalian 
subjects. 

[00172] The invention provides lentiviral vectors and expression systems capable 
of directing transcription of RNAs that hybridize to form shRNAs and/or siRNAs in 
30 mammalian cells. In particular, the invention provides a lentiviral vector comprising 
a nucleic acid segment operably linked to a promoter, so that transcription from the 
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promoter results in synthesis of one or more RNAs that self-hybridize or hybridize 
with each other to form an shRNA or siRNA targeted to a target transcript. 
[00173] The invention further provides a three-plasmid lenti viral expression system 
comprising (i) a lentiviral transfer plasmid comprising a nucleic acid segment 
5 operably linked to a promoter, so that transcription from the promoter results in 
synthesis of one or more RNAs that self-hybridize or hybridize with each other to 
form an shRNA or siRNA targeted to a target transcript; (ii) a packaging plasmid; and 
(iii) an Env-coding plasmid. In addition, the invention provides an infectious 
lentiviral particle comprising a nucleic acid segment operably linked to a promoter, so 

10 that transcription from the promoter results in synthesis of one or more RNAs that 

self-hybridize or hybridize with each other to form an shRNA or siRNA targeted to a 
target transcript. In other words, the nucleic acid segment(s) provides template(s) for 
synthesis of an RNA that self-hybridizes to form an shRNA or for synthesis of two 
complementary RNAs that hybridize to form an siRNA. 

15 [00174] According to certain embodiments of the invention the lentiviral vector 
comprises a nucleic acid segment which, when transcribed, produces an RNA that 
comprises two complementary elements that hybridize to one another to form a stem 
and a loop. The stem-loop structure is also referred to as a hairpin. Figure 15A 
J schematically depicts such a nucleic acid segment 10 operably linked to a promoter 

20 element 20. Nucleic acid segment 10 comprises complementary elements 30 and 40, 
separated by element 50. Preferably the nucleic acid includes a transcriptional 
terminator element 60, e.g., a terminator for RNA polymerase III such as a string of T 
residues. However, such a terminator element may also be provided within a vector 
into which the nucleic acid segment is inserted. Figure 15B schematically depicts an 

25 RNA 70 transcribed from nucleic acid segment 10 prior to hybridization. RNA 70 
comprises self-complementary elements 80 and 90. 

[00175] Figure 15C schematically depicts the RNA following hybridization of the 
complementary portions, resulting in formation of stem 100 and loop 110. 
Termination within the terminator sequence results in a 3' overhang 120, which may 
30 comprise one or more U residues. Preferably, the stem is approximately 19 bp long, 
the loop is about 1-20, more preferably about 4-12, and most preferably about 6-10 
nt long and/or the overhang is about 1-20, and more preferably about 2-6 nt long. In 
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certain preferred embodiments of the invention the overhang is 2 nt long. In certain 
embodiments of the invention the stem is minimally 19 nucleotides in length and may 
be up to approximately 29 nucleotides in length. One of ordinary skill in the art will 
appreciate that loops of 4 nucleotides or greater are less likely to be subject to steric 
5 constraints than are shorter loops and therefore may be preferred. 

[00176] Figure 17A schematically depicts the sequence of a nucleic acid 
comprising a segment which, when transcribed, produces an RNA that comprises two 
complementary elements that hybridize to one another to form a stem and a loop, 
inserted into the MCS of a lenti viral transfer plasmid of the invention. 

10 Complementary portions are indicated with arrows in opposite orientation to one 
another. Figure 17B depicts a nucleic acid which, when transcribed, results in an 
RNA targeted to the CD8 molecule. Figure 17C depicts the shRNA that results 
following hybridization of the complementary portions of an RNA transcribed from 
the nucleic acid in Figure 17B. The RNA forms a stem-loop structure in which the 

15 stem is targeted to CD8. As described in more detail in Example 3, the inventors have 
shown that lentiviral transfer plasmids comprising a heterologous nucleic acid whose 
sequence includes the CD8 stem-loop sequence inhibit expression of CD 8 when 
introduced into cells. In addition, as described in Examples 4 and 5, lentiviral 
particles comprising a heterologous nucleic acid whose sequence includes the CD8 

20 stem-loop sequence inhibit expression of CD8 at both the mRNA and protein level 
when introduced into cells. Furthermore, the inhibition of expression persisted over 
the length of the experiment (1 month), demonstrating that RNAi mediated by the 
integrated lentivirus was stable. The inventors were unable to detect shRNA 
structures in the infected cells but were able to detect approximately 21 nucleotide- 

25 long RNAs comprising the CD8 stem loop sequence and having a typical siRNA 
structure. While not wishing to be bound by any theory, this results confirms the 
hypothesis that shRNAs are processed into siRNAs within the cell. The inventors 
also demonstrated that shRNA-mediated inhibition of CD8 was specific. In 
particular, shRNAs targeted to the mouse CD8 RNA did not inhibit expression of 

30 human CD8. 

[00177] The invention therefore provides a lentiviral vector, e.g., a lentiviral 
transfer plasmid or lentiviral particle comprising a nucleic acid segment that provides 
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a template for synthesis of one or more RNAs that self-hybridize or hybridize with 
each other to form an shRNA or siRNA, wherein the shRNA or siRNA is targeted to a 
target transcript and reduces expression of the transcript. For example, the invention 
provides a lentiviral transfer plasmid comprising the following elements: a nucleic 
5 acid including (i) a functional packaging signal; (ii) a multiple cloning site (MCS) 

into which a nucleic acid may be inserted; (iii) at least one additional element selected 
from the group consisting of: a second MCS, an HIV FLAP element, a heterologous 
promoter, a heterologous enhancer, an expression-enhancing posttranscriptional 
regulatory element, a target site for a site-specific recombinase, and a self-inactivating 

10 (SIN) LTR; and (iv) a nucleic acid segment that provides a template for synthesis of 
an shRNA or siRNA, which shRNA or siRNA is targeted to a target transcript and 
reduces expression of the transcript. In certain preferred embodiments of the 
invention the nucleic acid segment provides a template for synthesis of an RNA that 
self-hybridizes to form an shRNA. 

15 [00178] Any of the various embodiments of the elements included in the lentiviral 
transfer plasmid or lentiviral particle may be selected as described above. In 
particular, the invention provides a lentiviral transfer plasmid comprising the 
following elements: a nucleic acid including (i) a functional packaging signal; (ii) a 
multiple cloning site (MCS) into which a nucleic acid may be inserted; (iii) a second 

20 MCS; (iv) an HIV FLAP element; (v) a WRE; (vi) two loxP sites; (vii) a self- 
inactivating (SIN) LTR; and (viii) a nucleic acid segment operably linked to a PolIII 
promoter, wherein the nucleic acid segment provides a template for synthesis of one 
or more RNAs that self-hybridize or hybridize with each other to form an shRNA or 
siRNA, which shRNA or siRNA is targeted to a target transcript and reduces 

25 expression of the transcript. In certain preferred embodiments of the invention the 
nucleic acid segment provides a template for synthesis of an RNA that hybridizes to 
form an shRNA. 

[001 79] Identification of sequences for design of the stem portion of an shRNA 
may be performed as described above for siRNAs. See also the Web sites having 
30 URLs 

www.mpibpc.gwdg.de/abteilungen/1 00/1 05/sirna.html and 

katahdin.chsl.org:9331/RNAi (visited October 23, 2002). The first step is to search 
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for potential target sequences, e.g., by scanning the cDNA. According to certain 
embodiments of the invention a potential target sequence corresponds to any sequence 
of the form GNis- According to other embodiments of the invention a potential target 
sequence corresponds to a sequence of the form AAGNig. According to yet other 
5 embodiments of the invention a potential target sequence corresponds to a sequence 
of the form AAGNisTT. Once a potential target is selected, the sequence GNi8 is 
used as the sequence for the stem (duplex) portion of the shRNA. Thus in certain 
embodiments of the invention the GN18 is preferably be surrounded by AA — TT in 
the context of the mRNA. Where the U6 promoter is used, a 5 5 guanine is generally 

10 required due to the constraints of this promoter. It may be useful to test 4-5 targets for 
each transcript or gene of interest. It may be desirable to perform a database search 
(e.g., BLAST search) using the GNi 8 sequences to verify that the sequence is unique 
in order to avoid silencing other genes in addition to the target gene. As lentivirus 
pseudotyped with VSV-G is capable of infecting human cells, if the lentivirus is not 

15 intended for use in humans it may be deisrable to determine if there are human genes 
that may be silenced. If so, it may be preferable to avoid sequences that would target 
such genes. 

[00180] According to certain embodiments of the invention the sequence 
TTCAAGAGA (SEQ ID NO: 10) is selected for the loop. Thus to design the 

20 complete hairpin sequence according to certain embodiments of the invention, a 19 nt 
sequence suitable as the inhibitory portion of a typical siRNA is selected, optionally 
including an additional two nucleotides such as AA at the 5' end in order to generate a 
3' UU overhang in the resulting shRNA. A loop sequence is added at the 3' end of 
the 19 (or 21) nt sequence, followed by a sequence complementary to the 19 nt (or 21) 

25 sequence, resulting in a stem-loop after hybridization. See Example 3 for additional 
information. Any of a variety of other sequences may be selected for the loop 
including, but not limited to, loops used in the shRNAs described in Brummelkamp, 
et aL, Paddison, et aL, Sui, et aL, Yui, et aL, or Paul, et aL 

[00181] The invention provides a method of reducing or inhibiting expression of a 
30 target transcript in a cell comprising: (i) delivering a lentiviral vector to the cell, 

wherein presence of the lentiviral vector within the cell results in transcription of one 
or more RNAs that self-hybridize or hybridize with each other to form an shRNA or 
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siRNA that is targeted to the target transcript and reduces expression of the transcript. 
The lentiviral vector comprises a nucleic acid segment that provides a template for 
synthesis of one or more RNAs that self-hybridize or hybridize with each other to 
form an shRNA or siRNA that is targeted to the transcript and reduces expression of 
5 the transcript. In certain preferred embodiments of the invention the nucleic acid 
provides a template for synthesis of an RNA that hybridizes to form an shRNA. 
[00182] According to certain embodiments of the invention the cell is a 
mammalian cell. Any of the lentiviral transfer plasmids or lentiviral particles 
described above may be used, wherein presence of the plasmid or particle in a cell 

10 provides a template for synthesis of one or more RNAs that self-hybridize or 

hybridize with each other to form a shRNA or siRNA targeted to the transcript of 
interest. According to certain embodiments of the invention the delivering step 
comprises delivering the lentiviral transfer plasmid or lentiviral particle to a 
mammalian subject, thereby delivering the lentiviral transfer plasmid or lentiviral 

1 5 particle to a cell that is present within the body of the subject. According to certain 
embodiments of the invention the cell is a primary cell. By "primary cell" is meant a 
cell that has been removed from the body of a subject and maintained in tissue culture 
for less than approximately 1, 2, 3, 4, or 5 doubling periods or a non-immortalized 
cell. According to certain embodiments of the invention the mammalian cell is a 

20 nondividing cell, e.g., a terminally differentiated T cell, neuron, hepatocyte, retinal 
cell, skeletal myocyte, cardiac myocyte, keratinocyte, macrophage, etc. The 
mammalian cell may be a human cell or a nonhuman (e.g., mouse or rat) cell. 
According to certain embodiments of the invention the mammalian cell is an 
embryonic cell or an embryonic stem cell. 

25 [001 83] The invention further provides a method for reversibly inhibiting or 

reducing expression of a target transcript in a cell comprising: delivering a lentiviral 
vector to the cell, wherein the lentiviral vector comprises a nucleic acid segment that 
provides a template for synthesis of one or more RNAs that self-hybridize or 
hybridize with each other to form an shRNA or siRNA, which shRNA or siRNA is 

30 targeted to the target transcript and reduces expression of the transcript, wherein the 
nucleic acid segment is located between sites for a site-specific recombinase; and (ii) 
inducing expression of the site-specific recombinase within the cell, thereby 
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preventing synthesis of at least one of the RNAs. According to certain embodiments 
of the invention the cell is a mammalian cell. According to certain embodiments of 
the invention the recombinase is Cre. According to certain embodiments of the 
invention the step of inducing the site-specific recombinase comprises introducing a 
5 vector encoding the site-specific recombinase into the cell. According to other 

embodiments of the invention a nucleic acid encoding the site-specific recombinase is 
operably linked to an inducible promoter, and the inducing step comprises inducing 
the promoter within a cell containing the nucleic acid, as described above. 
[00184] In addition, the invention provides a variety of methods for reversibly 

1 0 inhibiting or reducing expression of a target transcript in a conditional and/or tissue- 
specific manner. For example, the invention provides a method for reversibly 
inhibiting or reducing expression of a transcript in a mammal in a cell type specific or 
tissue-specific manner comprising: (i) delivering a lentiviral vector to cells of the 
mammal, wherein the lentiviral vector comprises a nucleic acid segment that provides 

15 a template for synthesis of one or more RNAs that self-hybridize or hybridize with 
each other to form an shRNA or siRNA, which shRNA or siRNA is targeted to the 
target transcript and reduces expression of the transcript, and wherein the nucleic acid 
segment is located between sites for a site-specific recombinase; and (ii) inducing 
expression of the site-specific recombinase in a subset of the cells of the mammal, 

20 thereby preventing synthesis of at least one of the RNAs within those cells. 

According to certain embodiments of the inventive methods the recombinase is Cre. 
According to certain embodiments of the invention the step of inducing the site- 
specific recombinase comprises introducing a vector encoding the site-specific 
recombinase into a subset of the cells of the subject, e.g., by utilizing a vector that 

25 requires a receptor present only on a subset of the cells. According to other 

embodiments of the invention a nucleic acid encoding the site-specific recombinase is 
operably linked to an inducible promoter, and the inducing step comprises inducing 
the promoter within cells containing the nucleic acid, as described above, whereby 
expression of the target transcript is restored only in cells or tissues in which the 

30 promoter is active. 

[00185] In certain embodiments of the invention the nucleic acid encoding the site- 
specific recombinase is operably linked to a cell type or tissue-specific promoter, so 
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that synthesis of the recombinase takes place only in cells or tissues in which that 
promoter is active, whereby expression of the target transcript is restored only in cells 
or tissues in which the promoter is active. 

[00186] In certain preferred embodiments of the invention, the promoter utilized to 
5 direct expression of the one or more RNAs that self-hybridize or hybridize with each 
other to form an shRNA or siRNA is a promoter for RNA polymerase III (Pol III). 
Pol III directs synthesis of small transcripts that terminate within a stretch of 4-5 T 
residues. Certain Pol III promoters such as the U6 or HI promoters do not require 
c^-acting regulatory elements (other than the first transcribed nucleotide) within the 

10 transcribed region and thus are preferred according to certain embodiments of the 
invention since they readily permit the selection of desired RNA sequences. In the 
case of naturally occurring U6 promoters the first transcribed nucleotide is guanosine, 
while in the case of naturally occurring HI promoters the first transcribed nucleotide 
is adenine. (See, e.g., Medina MF and Joshi S., Curr Opin Mol Ther 1999 

15 Oct;l(5):580-94; Yu, J., et al., Proc. Natl. Acad. Set, 99(9), 6047-6052 (2002); Sui, 
G., et al., Proc. Natl. Acad. Sci., 99(8), 5515-5520 (2002); Paddison, P., et al., Genes 
and Dev., 16, 948-958 (2002); Brummelkamp, T., et al., Science, 296, 550-553 
(2002); Miyagashi, M. and Taira, K., Nat. Biotech., 20, 497-500 (2002); Paul, C, et 
al., Nat. Biotech., 20, 505-508 (2002); Tuschl, T., et al., Nat. Biotech., 20, 446-448 

20 (2002)). Thus in certain embodiments of the invention, e.g., where transcription is 

driven by a U6 promoter, the 5' nucleotide of preferred RNA sequences for formation 
of shRNAs or siRNAs is G. In certain other embodiments of the invention, e.g., 
' where transcription is driven by an HI promoter, the 5' nucleotide may be A. The 
lentiviral transfer plasmid may be created by inserting a cassette comprising the RNA 

25 sequence into a transfer plasmid optimized for RNAi that already contains a suitable 
promoter, e.g., a plasmid such as pLL3.7. Alternately, a cassette comprising the RNA 
sequence operably linked to a suitable promoter may be inserted into a transfer 
plasmid that lacks such a promoter, e.g., a plasmid such as pLL3.0. 
[00187] The invention thus encompasses administration of a lentiviral vector to a 

30 cell, e.g., a mammalian cell, to inhibit or reduce expression of any target transcript or 
gene, wherein the lentiviral vector comprises a nucleic acid segment that provides a 
template for synthesis of one or more RNAs that self-hybridize or hybridize to form 
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an shRNA or siRNA that is targeted to the target transcript or gene. In general, the 
nucleic acid segment may provide a template for synthesis of any RNA structure 
capable of being processed in vivo to a shRNA or siRNA, so long as the RNA does 
not induce other negative events such as induction of the interferon response. In 
5 certain preferred embodiments of the invention the nucleic acid segment provides a 
template for synthesis of an RNA that self-hybridizes to form an shRNA targeted to 
the target transcript. 

[00188] As discussed above, in addition to their use for synthesis of RNAs that 
self-hybridize to form shRNAs, the lentiviral vectors of the invention may be used for 

10 synthesis of various other RNAs that mediate RNAi. In particular, two separate 

approximately 21 nt RNA strands may be generated, each of which contains a 19 nt 
region complementary to the other, and the individual strands may hybridize together 
to generate an siRNA structure. Accordingly, the invention encompasses a lentiviral 
vector comprising two transcribable regions, each of which provides a template for 

15 synthesis of a transcript containing a region complementary with the other. Generally 
each transcript will be approximately 21 nt in length and the complementary regions 
will be approximately 19 nt in length, as described above. 
[00189] In addition, the invention provides a lentiviral vector that contains 
oppositely directed promoters flanking a nucleic acid segment and positioned so that 

20 two different transcripts, approximately 21 nt in length and having complementary 
regions approximately 19 nt in length, are generated. It will be appreciated that 
appropriate terminators should be supplied in these cases. In cases in which the RNA 
structure undergoes one or more processing steps, those of ordinary skill in the art 
will appreciate that the nucleic acid segment will preferably be designed to include 

25 sequences that may be necessary for processing of the RNA. Figure 1 6 presents a 
schematic diagram of such a plasmid. 

[00190] A large number of variations on the above are possible. For example, the 
lentiviral vector may comprise multiple nucleic acid segments, each of which 
provides a template for synthesis of one or more RNAs that self-hybridize or 
30 hybridize with each other to form shRNAs or siRNAs, which shRNAs or siRNAs may 
target the same transcript or different transcripts. In addition, according to certain 
embodiments of the invention the nucleic acid segment provides a template for 
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synthesis of a plurality of RNAs that self-hybridize or hybridize with each other to 
form a plurality of siRNAs or siRNA precursors. For example, a single promoter may 
direct synthesis of a single RNA transcript containing multiple self-complementary 
regions, each of which may hybridize to generate a plurality of stem-loop structures. 
5 These structures may be cleaved in vivo, e.g., by DICER, to generate multiple 
different siRNAs. It will be appreciated that such transcripts preferably contain a 
termination signal at the 3' end of the transcript but not between the individual siRNA 
units. 

[00191] The present invention encompasses any cell manipulated to contain an 
10 inventive lentiviral transfer plasmid, lentiviral particle, or lenti viral genome derived 
therefrom (e.g., a provirus), wherein the lentiviral transfer plasmid, lentiviral particle, 
or lentiviral genome provides a template for synthesis of one or more RNAs that self- 
hybridize or hybridize to form an shRNA or siRNA. Preferably, the cell is a 
mammalian cell. According to certain embodiments of the invention the cell is a 
15 human cell. Those of ordinary skill in the art will appreciate that intracellular 

expression of RNAs that self-hybridize or hybridize with each other to form shRNAs 
or siRNAs according to the present invention may allow the production of cells that 
produce the shRNA or siRNA over long periods of time (e.g., greater than a few days, 
preferably at least several months, more preferably at least a year or longer, possibly a 
20 lifetime). 

[00192] In certain embodiments of the invention, the cells are non-human cells 
within an organism. For example, the present invention encompasses transgenic 
animals the cells of which contain an inventive lentiviral transfer plasmid, lentiviral 
particle, or lentiviral genome derived therefrom, wherein the lentiviral transfer 

25 plasmid, lentiviral particle, or lentiviral genome provides a template for synthesis of 
one or more RNAs that self-hybridize or hybridize to form an shRNA or siRNA in 
one or more cell types or tissues of the transgenic animal. The invention therefore 
provides a transgenic animal, one or more cells of which comprise a heterologous 
nucleic acid segment provided by a lentiviral vector, wherein the lentiviral vector 

30 comprises (i) a functional packaging signal; (ii) a multiple cloning site (MCS); and 
(iii) at least one additional element selected from the group consisting of: a second 
MCS, a second MCS into which a heterologous promoter or promoter-enhancer is 
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inserted, an HIV FLAP element, an expression-enhancing p o s ttr ans crip tional 
regulatory element, a target site for a site-specific recombinase, and a self-inactivating 
(SIN) LTR . According to certain embodiments of the invention the cells of the 
transgenic animal contain a heterologous nucleic acid segment that comprises sites for 
5 a site-specific recombinase. 

[00193] As described in Example 7, the inventors have generated transgenic mice 
using a variety of lentiviral particles each comprising a first heterologous nucleic acid 
segment encoding the fluorescent protein GFP 5 and also a second heterologous 
nucleic acid segment that provides a template for synthesis of an RNA that self- 

10 hybridizes to form an shRNA targeted to a target transcript. The lentiviral particles 
were able to induce expression of GFP within embryonic stem cells (ES cells), and 
these ES cells gave rise to transgenic animals whose cells expressed GFP. 
Furthermore, expression of the particular target transcript corresponding to the second 
nucleic acid segment was reduced or inhibited in cells of the transgenic mice. These 

15 results demonstrate that the lentiviral transfer plasmids and lentiviral particles of the 
invention may be used to generate transgenic animals in which expression of a target 
transcript is reduced or inhibited. It is noted that the lentiviral vectors of the invention 
may thus generally be used a bifunctional vectors, leading both to expression of a 
heterologous nucleic acid and silencing of an endogenous gene. 

20 [00194] Kits 

[00195] The invention provides a variety of kits comprising one or more of the 
lentiviral transfer plasmids of the invention. For example, the invention provides a 
kit comprising (a) a lentiviral transfer plasmid comprising a nucleic acid sequence 
including (i) a functional packaging signal; (ii) a multiple cloning site (MCS) into 

25 which a heterologous gene may be inserted; and (iii) at least one additional element 
selected from the group consisting of: a second MCS, an HIV FLAP element, a 
heterologous promoter, a heterologous enhancer, an expression-enhancing 
posttranscriptional regulatory element, a target site for a site-specific recombinase, 
and a self-inactivating (SIN) LTR; and one or more of the following items: (i) a 

30 packaging mix comprising one or more plasmids that collectively provide nucleic acid 
sequences coding for retroviral or lentiviral Gag and Pol proteins and an envelope 
protein. The packaging mix may contain two or more plasmids. According to certain 
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embodiments of the invention the packaging mix includes two plasmids, one of which 
provides nucleic acid sequences coding for Gag and Pol and the other of which 
provides nucleic acid segments coding for an envelope protein; (ii) cells (e.g., a cell 
line) that are permissive for production of lentiviral particles such as 293T cells; (iii) 
5 packaging cells, e.g., a cell line that is permissive for production of lentiviral particles 
and provides the proteins Gag, Pol, Env, and, optionally, Rev; (iv) cells suitable for 
use in titering lentiviral particles; a transfection-enhancing agent such as 
Lipofectamine; (v) a selection agent such as an antibiotic, preferably corresponding to 
an antibiotic resistance gene in the lentiviral transfer plasmid; (vi) instructions for use; 
10 (vii) a lentiviral transfer plasmid comprising a heterologous nucleic acid segment such 
as a reporter gene that may serve as a positive control (referred to as a "positive 
control plasmid"). 

[00196] According to certain embodiments of the invention the kit contains a set of 
lentiviral transfer plasmids comprising a variety of different heterologous promoters 
15 and/or reporter genes. For example, the kit may contain a set of two or more vectors 
selected from the group consisting of the plasmids of: SEQ ID NO: 2, SEQ ID NO: 3, 
SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, and 
SEQ ID NO: 9. 

[00197] Therapeutic Applications and Pharmaceutical Formulations 
20 [00198] The lentiviral vectors of the invention are useful for a wide variety of 

therapeutic applications. In particular, they are useful in any context for which gene 
therapy is contemplated. For example, lentiviral vectors comprising a heterologous 
nucleic acid segment operably linked to a promoter are useful for any disease or 
clinical condition associated with reduction or absence of the protein encoded by the 
25 heterologous nucleic acid segment, or any disease or clinical condition that can be 
effectively treated by augmenting the expression of the encoded protein within the 
subject. For example, lentiviral vectors comprising a nucleic acid segment encoding 
the cystic fibrosis transmembrane conductance regulator (CFTR) or encoding otl- 
antitrypsin may be used for the treatment of cystic fibrosis and a 1 -antitrypsin 
30 deficiency, respectively. Lentiviral vectors comprising a nucleic acid segment 

encoding Factor VIII or Factor IX may be used for treatment of hemophilia A or B 5 
respectively. See the Web site having URL www.wiley.co.ulc/genetherapy/clinical/ 
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(visited October 19, 2002) for a representative list of current gene therapy clinical 
trials involving expression of a therapeutic protein in a subject in need of treatment. 
[00199] Inventive lentiviral vectors capable of causing intracellular synthesis of 
inhibitory RNAs (siRNAs or shRNAs) are useful in treating any disease or clinical 
5 condition associated with overexpression of a transcript or its encoded protein in a 
subject, or any disease or clinical condition that may be treated by causing reduction 
of a transcript or its encoded protein in a subject. For example, many cancers are 
associated with overexpression of oncogene products. Delivering a lentiviral vector 
that provides a template for synthesis of one or more RNAs that self-hybridize or 

10 hybridize with each other to form an shRNA or siRNA targeted to the transcript 

encoding the oncogene product may be used to treat such cancers. The high degree of 
specificity achieved by RNA interference suggests that it is possible to selectively 
target transcripts containing single base pair mutations while not interfering with 
expression of the normal cellular allele. Lenviral vectors that provide a template for 

1 5 synthesis of one or more RNAs that self-hybridize or hybridize with each other to 

form an shRNA or siRNA targeted to a transcript encoding a cytokine may be used to 
regulate immune system responses (e.g., responses responsible for organ transplant 
rejection, allergy, autoimmune diseases, inflammation, etc.). Lentiviral vectors that 
provide a template for synthesis of one or more RNAs that self-hybridize or hybridize 

20 with each other to form an shRNA or siRNA targeted to a transcript of an infectious 
agent or targeted to a cellular transcript whose encoded product is necessary for or 
contributes to any aspect of the infectious process may be used in the treatment of 
infectious diseases. 

[00200] Gene therapy protocols may involve administering an effective amount of 
25 a lentiviral vector whose presence within a cell results in production of a therapeutic 
siRNA or shRNA to a subject either before, substantially contemporaneously, with, or 
after the onset of a condition to be treated. Another approach that may be used 
alternatively or in combination with the foregoing is to isolate a population of cells, 
e.g., stem cells or immune system cells from a subject, optionally expand the cells in 
30 tissue culture, and administer a lentiviral vector whose presence within a cell results 
in production of a therapeutic siRNA or shRNA to the cells in vitro. The cells may 
then be returned to the subject, where, for example, they may provide a population of 
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cells that produce a therapeutic shRNA or siRNA, or that are resistant to infection by 
an infectious organism, etc. Optionally, cells expressing the therapeutic shRNA or 
siRNA can be selected in vitro prior to introducing them into the subject. In some 
embodiments of the invention a population of cells, which may be cells from a cell 
5 line or from an individual other than the subject, can be used. Methods of isolating 
stem cells, immune system cells, etc., from a subject and returning them to the subject 
are well known in the art. Such methods are used, e.g., for bone marrow transplant, 
peripheral blood stem cell transplant, etc., in patients undergoing chemotherapy. 
[00201] Compositions comprising lentiviral vectors of the invention may provide a 

10 template for a single siRNA or shRNA species, targeted to a single site in a single 
target transcript, or alternatively may provide templates for a plurality of different 
siRNA or shRNA species, targeted to one or more sites in one or more target 
transcripts. In some embodiments of the invention, it will be desirable to utilize 
compositions comprising one or more lentiviral vectors, wherein presence of the 

15 lentiviral vector(s) within a cell or within different cells in the body, results in 

production of a plurality of different siRNA or shRNA species targeted to different 
genes, which may be cellular genes or, where an infection is being treated, genes of an 
infectious organism. Also, some embodiments will provide templates for more than 
one siRNA or shRNA species targeted to a single transcript. To give but one 

20 example, it may be desirable to provide templates for synthesis of one or more RNAs 
that self-hybridize or hybridize with each other to fonn at least one siRNA or shRNA 
targeted to coding regions of a target transcript and at least one siRNA or shRNA 
targeted to the 3' UTR. This strategy may provide extra assurance that products 
encoded by the relevant transcript will not be generated because at least one siRNA or 

25 shRNA will target the transcript for degradation while at least one other inhibits the 
translation of any transcripts that avoid degradation. The invention encompasses 
"therapeutic cocktails", including approaches in which a single lentiviral particle 
provides templates for synthesis of one or more RNAs that self-hybridize or hybridize 
to form shRNAs or siRNAs that inhibit multiple target transcripts. 

30 [00202] It may be desirable to combine the administration of inventive lentiviral 
vectors with one or more additional therapeutic agents. The invention therefore 
encompasses compositions comprising a lentiviral vector of the invention, preferably 
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a lentiviral particle, and a second therapeutic agent, e.g., a composition approved by 
the U.S. Food and Drug Administration. 

[00203] Inventive compositions may be formulated for delivery by any available 
route including, but not limited to parenteral (e.g., intravenous), intradermal, 
5 subcutaneous, oral (e.g., inhalation), transdermal (topical), transmucosal, rectal, and 
vaginal. Preferred routes of delivery include parenteral, transmucosal, rectal, and 
vaginal. Inventive pharmaceutical compositions typically include a lentiviral vector 
in combination with a pharmaceutically acceptable carrier. As used herein the 
language "pharmaceutically acceptable carrier" includes solvents, dispersion media, 

10 coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, 
and the like, compatible with pharmaceutical administration. Supplementary active 
compounds can also be incorporated into the compositions. 
[00204] A pharmaceutical composition is formulated to be compatible with its 
intended route of administration. Solutions or suspensions used for parenteral, 

15 intradermal, or subcutaneous application can include the following components: a 
sterile diluent such as water for injection, saline solution, fixed oils, polyethylene 
glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial agents 
such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; buffers 

20 such as acetates, citrates or phosphates and agents for the adjustment of tonicity such 
as sodium chloride or dextrose. pH can be adjusted with acids or bases, such as 
hydrochloric acid or sodium hydroxide. The parenteral preparation can be enclosed 
in ampoules, disposable syringes or multiple dose vials made of glass or plastic. 
[00205] Pharmaceutical compositions suitable for injectable use typically include 

25 sterile aqueous solutions (where water soluble) or dispersions and sterile powders for 
the extemporaneous preparation of sterile injectable solutions or dispersion. For 
intravenous administration, suitable carriers include physiological saline, 
bacteriostatic water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered 
saline (PBS). In all cases, the composition should be sterile and should be fluid to the 

30 extent that easy syringability exists. Preferred pharmaceutical formulations are stable 
under the conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fungi. In general, the 
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relevant carrier can be a solvent or dispersion medium containing, for example, water, 
ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyetheylene 
glycol, and the like), and suitable mixtures thereof. The proper fluidity can be 
maintained, for example, by the use of a coating such as lecithin, by the maintenance 
5 of the required particle size in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various antibacterial 
and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. In many cases, it will be preferable to include isotonic 
agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride in 
10 the composition. Prolonged absorption of the injectable compositions can be brought 
about by including in the composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. 

[00206] Sterile injectable solutions can be prepared by incorporating the active 
compound in the required amount in an appropriate solvent with one or a combination 

15 of ingredients enumerated above, as required, followed by filtered sterilization. 
Generally, dispersions are prepared by incorporating the active compound into a 
sterile vehicle which contains a basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the preferred methods of preparation are 

20 vacuum drying and freeze-drying which yields a powder of the active ingredient plus 
any additional desired ingredient from a previously sterile-filtered solution thereof. 
[00207] Oral compositions generally include an inert diluent or an edible carrier. 
For the purpose of oral therapeutic administration, the active compound can be 
incorporated with excipients and used in the form of tablets, troches, or capsules, e.g., 

25 gelatin capsules. Oral compositions can also be prepared using a fluid carrier for use 
as a mouthwash. Pharmaceutically compatible binding agents, and/or adjuvant 
materials can be included as part of the composition. The tablets, pills, capsules, 
troches and the like can contain any of the following ingredients, or compounds of a 
similar nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; 

30 an excipient such as starch or lactose, a disintegrating agent such as alginic acid, 

Primogel, or corn starch; a lubricant such as magnesium stearate or Sterotes; a glidant 
such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a 
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flavoring agent such as peppermint, methyl salicylate, or orange flavoring. 
Formulations for oral delivery may advantageously incorporate agents to improve 
stability within the gastrointestinal tract and/or to enhance absorption. 
[00208] For administration by inhalation, the inventive lentiviral vectors are 
5 preferably delivered in the form of an aerosol spray from pressured container or 

dispenser which contains a suitable propellant, e.g., a gas such as carbon dioxide, or a 
nebulizer. 

[00209] Systemic administration can also be by transmucosal or transdermal 
means. For transmucosal or transdermal administration, penetrants appropriate to the 

10 barrier to be permeated are used in the formulation. Such penetrants are generally 
known in the art, and include, for example, for transmucosal administration, 
detergents, bile salts, and fusidic acid derivatives. Transmucosal administration can 
be accomplished through the use of nasal sprays or suppositories. For transdermal 
administration, the active compounds are formulated into ointments, salves, gels, or 

1 5 creams as generally known in the art. 

[00210] The compounds can also be prepared in the form of suppositories (e.g., 
with conventional suppository bases such as cocoa butter and other glycerides) or 
retention enemas for rectal delivery. 

[00211] In one embodiment, the active agents, i.e., a lentiviral vector of the 
20 invention and/or other agents to be administered together with a lentiviral vector of 
the invention, are prepared with carriers that will protect the compound against rapid 
elimination from the body, such as a controlled release formulation, including 
implants and microencapsulated delivery systems. Biodegradable, biocompatible 
polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic 
25 acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of such 
formulations will be apparent to those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. 
Liposomal suspensions (including liposomes targeted to infected cells with 
monoclonal antibodies to viral antigens) can also be used as pharmaceutically 
30 acceptable carriers. These can be prepared according to methods known to those 
skilled in the art, for example, as described in U.S. Patent No. 4,522,81 1. 
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[00212] It is advantageous to formulate oral or parenteral compositions in dosage 
unit form for ease of administration and uniformity of dosage. Dosage unit form as 
used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
5 calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. 

[00213] Toxicity and therapeutic efficacy of such compounds can be determined 
by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., 
for determining the LD 50 (the dose lethal to 50% of the population) and the ED 50 (the 

10 dose therapeutically effective in 50% of the population). The dose ratio between 
toxic and therapeutic effects is the therapeutic index and it can be expressed as the 
ratio LD 50 / ED 50 . Compounds which exhibit high therapeutic indices are preferred. 
While compounds that exhibit toxic side effects can be used, care should be taken to 
design a delivery system that targets such compounds to the site of affected tissue in 

15 order to minimize potential damage to uninfected cells and, thereby, reduce side 
effects. 

[00214] The data obtained from cell culture assays and animal studies can be used 
in formulating a range of dosage for use in humans. The dosage of such compounds 
lies preferably within a range of circulating concentrations that include the ED 5 o with 

20 little or no toxicity. The dosage can vary within this range depending upon the 

dosage form employed and the route of administration utilized. For any compound 
used in the method of the invention, the therapeutically effective dose can be 
estimated initially from cell culture assays. A dose can be formulated in animal 
models to achieve a circulating plasma concentration range that includes the IC 50 (i.e., 

25 the concentration of the test compound which achieves a half-maximal inhibition of 
symptoms) as determined in cell culture. Such information can be used to more 
accurately determine useful doses in humans. Levels in plasma can be measured, for 
example, by high performance liquid chromatography. 

[00215] The pharmaceutical composition can be administered at various intervals 
30 and over different periods of time as required, e.g., one time per week for between 

about 1 to 10 weeks, between 2 to 8 weeks, between about 3 to 7 weeks, about 4, 5, or 
6 weeks, etc. For certain conditions such as HIV it may be necessary to administer 
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the therapeutic composition on an indefinite basis to keep the disease under control. 
The skilled artisan will appreciate that certain factors can influence the dosage and 
timing required to effectively treat a subject, including but not limited to the severity 
of the disease or disorder, previous treatments, the general health and/or age of the 
5 subject, and other diseases present. Generally, treatment of a subject with a lenti viral 
vector as described herein, can include a single treatment or, in many cases, can 
include a series of treatments. 

[00216] Exemplary doses for administration of gene therapy vectors are known in 
the art. It is furthermore understood that appropriate doses of a lentiviral vector that 

10 provides a template for synthesis of one or more RNAs that self-hybridize or 

hybridize with each other to form an shRNA or siRNA may, in general, depend upon 
the potency of the siRNA or shRNA and may optionally be tailored to the particular 
recipient, for example, through administration of increasing doses until a preselected 
desired response is achieved. It is understood that the specific dose level for any 

1 5 particular animal subject may depend upon a variety of factors including the activity 
of the specific compound employed, the age, body weight, general health, gender, and 
diet of the subject, the time of administration, the route of administration, the rate of 
excretion, any drug combination, and the degree of expression or activity to be 
modulated. 

20 [00217] Lentiviral gene therapy vectors can be delivered to a subject by, for 

example, intravenous injection, local administration, or by stereotactic injection (see 
e.g., Chen et al (1994) Proc. Natl. Acad. Sci. USA 91:3054-3057). In certain 
embodiments of the invention the vectors may be delivered orally or inhalationally 
and may be encapsulated or otherwise manipulated to protect them from degradation, 

25 enhance uptake into tissues or cells, etc. The pharmaceutical preparation can include 
the lentiviral vector in an acceptable diluent, or can comprise a slow release matrix in 
which the lentiviral vector is imbedded. Alternatively, where the vector can be 
produced intact from recombinant cells, as is the case for retroviral or lentiviral 
vectors as described herein, the pharmaceutical preparation can include one or more 

30 cells which produce the vectors. 

[00218] Inventive pharmaceutical compositions can be included in a container, 
pack, or dispenser together with instructions for administration. 
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Exemplification 

[00219] Example 1: Generation ofpLentiLox Vectors 
5 This example describes generation of the pLentiLox family of vectors. Unless 

otherwise indicated, standard molecular biology techniques were generally performed 
in accordance with guidance found in Current Protocols in Molecular Biology, 
edition as of 2001; or in Sambrook, Russell, and Sambrook, Molecular Cloning: A 
Laboratory Manual, 3 rd ed., Cold Spring Harbor Laboratory Press, Cold Spring 

1 0 Harbor, 200 1 , or according to instructions provided by the manufacturer of the 
relevant reagents or kits. It is noted that a variety of different approaches to 
generating the constructs described below as well as alternative sources for the 
elements incorporated into the constructs may be employed. In particular, the 
sequence information provided herein enables one of ordinary skill in the art to 

15 chemically synthesize part or all of the constructs, thus offering considerable 
flexibility. 

[00220] Characterization of pBFGW. Generation of the pLentiLox vector family 
involved extensive modification of the plasmid pBFGW. Accordingly, the first step 
was a thorough characterization of this plasmid. pBFGW is a third generation 

20 lentiviral plasmid based upon the pCDNA 3 . 1/Zeo plasmid (Invitrogen) that was 

incompletely characterized and lacked sequence information. pBFGW is a member 
of the same vector family as pFUGW 24 but contains a Beta-actin/CMV hybrid 
promoter rather than a ubiquitin promoter. Lentiviral elements in pBFGW are 
derived from HIV-1. We sequenced this plasmid in its entirety. The sequence is 

25 presented as SEQ ID NO: 1. A restriction map of pBFGW was generated based upon 
the sequencing information and verified (for several enzymes) by digestion and 
agarose gel electrophoresis and is shown in Figure 1. 

[00221] pBFGW includes a cassette for the generation of lentivirus inserted 
downstream of the CMV promoter of pCDNA3.1/Zeo. The cassette consists of a 5' 
30 self-inactivating (SIN) LTR, the required packaging sequence (Psi) , the HIV FLAP 
element (FLAP), a hybrid promoter consisting of beta-actin and CMV promoter 
sequences, the open reading frame for enhanced Green Fluorescent Protein (EGFP), 
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the Woodchuck Hepatitis Regulatory Element (WRE), and the 3 ' SIN LTR. There 
existed only three unique restriction sites for the introduction of transgenes and/or 
promoters. In addition pBFGW was 10,441 base pairs in length without the 
introduction of a transgene. Plasmids of this size are more difficult to manipulate 
5 then smaller plasmids. There was also no mechanism to eliminate transgene 
expression after infection. 

[00222] Elimination of elements between FLAP and WRE. pBFGW was 
sequentially digested with Pad and EcoRL The 7 5 930 bp fragment representing the 
backbone was purified by gel purification. The overhanging ends were filled-in by 

10 reaction with Pfu polymerase to generate blunt ends. 

[00223] Introduction of a MCS-LoxP-MCS-LoxP cassette. The plasmid pBluescript 
Lox (pBS-Lox) was created by amplifying two LoxP sites from an unrelated vector 
(pML2MIG) by PGR. The two LoxP PCR products contained a 23bp overlap region 
(containing the restriction sites EcoRl, NotI and Hindlll) at their 5' and 3 5 ends 

15 respectively. These two products were combined in a splicing overlap extension 
(SOE) PCR reaction (Horton, RM, et al, Biotechniques, 8(5):528-35, 1990) using 
standard PCR conditions to create the LoxP-MCS-LoxP cassette, that was cloned into 
the filled-in Hindlll and NotI sites of pBlueScript (KSII+) as a blunt ended fragment. 
The following primers were used in the PCR reactions: 

20 [00224] PCR product #1 was amplified with the following primers: 
[00225] Ll/5': 5'(tggtgggtacctagtggaacc)3' (SEQ ID NO: 32) 
[00226] Ll/3': 5'(aagcttaagcggccgcagaattcgtcgagggacctaataacgtatag)3' (SEQ ID 
NO: 33) 

[00227] PCR product #2 was amplified with the following primers: 
25 [00228] L2/5*: 5 f (gaattctgcggccgcttaagcttggaacccttaatataacttcg)3' (SEQ ID NO: 
34) 

[00229] L2/3': 5'(cgcttcacgagattccagcag)3 f (SEQ ID NO: 35) 

[00230] The LoxP-MCS-LoxP cassette was used in the LentiLox cloning described 
below. The second LoxP site of this cassette contained a three base pair deletion. 
30 This deletion was not identified until later in the construction of the LentiLox vectors 
(see below). 
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[00231] pBS-Lox was digested with Asp71 8. A 209bp fragment was isolated by 
gel purification. This fragment contained a 5' multiple cloning site (5' MCS), a LoxP 
site, a 3' multiple cloning site (3' MCS), and a second LoxP site. The overhanging 
ends of the 209bp Asp718 fragment were filled-in with cloned Pfu polymerase, and 
5 ligated into the 7,930 bp fragment of pBFGW. The orientation of the insertion was 
determined both by restriction fragment length polymorphism and by sequencing. A 
plasmid containing the MC S -LoxP-MC S -LoxP cassette with the expected sequence in 
the correct orientation was named pLentiLox 1.0. 

[00232] Elimination of plasmid backbone restriction sites. Several of the sites 
10 found within the two MCS's were present elsewhere in the pLentiLox 1 .0 vector. 

Specifically, NotI, Apal, and Xhol cut both within the MCS cassette as well as once 
elsewhere in the vector. In order to gain the use of these sites we intended to destroy 
each site within the plasmid backbone. 

[00233] pLLl .0 was partially digested with NotI under conditions of limiting 

1 5 enzyme activity (.0625 Units of enzyme per microgram of pLLl .0 incubated at 37 
degrees Celsius for twenty minutes). The 8,142 bp band that represented linearized 
pLLl .0 was isolated via agarose gel electrophoresis followed by gel extraction. This 
linearized fragment was phosphorylated on its 5' ends with T4 Polynucleotide Kinase 
(PNK). The overhanging ends of this molecule were then filled-in with cloned Pfu to 

20 destroy the NotI site. The ends were ligated together to circularize the molecule. To 
determine whether a NotI site had been destroyed and to determine which NotI site 
had been destroyed the plasmid was digested with NotI and with PstL Destruction of 
the NotI site in the plasmid backbone yielded fragments of 7830 bp and 3 12 bp 
whereas destruction of the MCS NotI site yielded fragments of 7253 and 889 bp 

25 respectively. We accidentally chose a plasmid in which the MCS NotI site was 
destroyed and named it pLentiLox 1 . 1 (pLLl . 1). This was later remedied (see 
below). It should be noted that the creation of pLLl.l was problematic due to 
recombination within the vector resulting in large deletions of required sequences. To 
verify the presence of an intact backbone it was necessary to perform an additional 

30 restriction digest. An enzyme that cut in three places was used, and the digestion 

pattern of pLLl.l was compared with that of pLLl.O to make sure that the two bands 
representing the backbone were the same size. 
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[00234] We designed a strategy to destroy the Apal site that would also eliminate a 
2197 bp fragment between the 3' SIN LTR and the pUC ori that we deemed non- 
essential for lentiviral production. This Apal-Pcil fragment contained a BGH 
polyadenylation site, an SV40 promoter/ori, the Zeomycin resistance gene, and an 
5 SV40 polyadenylation site. We digested pLLl . 1 with Pcil to linearize the plasmid. 
The linearized plasmid was then digested with a limiting amount of Apal (between 
.25 Units and 2 Units per microgram of linearized pLLl.l for twenty minutes at room 
temperature). A 5,945 bp fragment representing a single cut with Apal adjacent to the 
3' LTR was isolated by agarose gel electorphoresis followed by gel purification. The 

10 gel purified fragment was phosphorylated with PNK, filled in with cloned Pfu, and 
circularized by ligation. The ligated DNA was digested with StuI prior to 
transformation into bacteria. Digestion with StuI was expected to specifically cut 
plasmid that contains the 2197 bp fragment that had been eliminated, and thus was 
used to select against contamination with uncut pLLl.l vector. The elimination of 

15 the 2197 bp and the destruction of Pcil and Apal were verified by restriction digest 
and a correct plasmid was identified and named pLentiLox 1 .2 (pLLl .2). 
[00235] To destroy the Xhol site, pLLl .2 was cut with limiting amounts of Xhol 
(.0625 Units per microgram of plasmid for twenty minutes at 37 degrees Celsius). A 
5,947 bp fragment representing single-cut linearized pLL1.2 was isolated via agarose 

20 gel electrophoresis and gel purification. The fragment was 5' phosphorylated with 
PNK, filled in with cloned Pfu, recircularized with ligase, and transformed into 
bacteria. Destruction of the correct Xhol site was verified by restriction digest. A 
correct plasmid was identified and named pLentiLoxl.3 (pLL1.3). 
[00236] Expansion of the 5' MCS. The 5' MCS was intended for the insertion of 

25 promoter sequences, among other purposes. After destruction of the sites mentioned 
above two unique cloning sites (Apal and Xhol) remained in this MCS. We derived a 
list of restriction enzymes that failed to cut pLL1.3 to generate a list of candidate sites 
to engineer into an expanded 5 'MCS. We then designed complementary 
oligonucleotides to allow us to introduce Xbal, Hpal, Nhel, and Pad sites between 

30 the Apal and Xhol sites. The oligonucleotides were designed to include two 
nucleotides between adjacent restriction sites. The sequence of the sense 
oligonucleotides was 5' cgctctagacggttaacgcgctagccgttaattaagcc 3' (SEQ ID NO: 11). 



Page 79 of 171 



WO 2004/022722 



PCT/US2003/028111 



The antisense oligonucleotide was complementary to this sequence but contained an 
additional four nucleotides at the 5' end to produce an Xliol overhang and four 
nucleotides at the 3' end to produce an Apal overhang. The antisense oligonucleotide 
sequence was 5 ? -tcgaggcttaattaacggctagcgcgttaaccgtctagagcgggcc-3' (SEQ ID NO: 
5 12). We chose restriction sites to include based upon the following criteria: (1) 
Inclusion of a site for a restriction enzyme that leaves a blunt end after cutting; (2) 
Inclusion of a restriction site that has an 8 bp recognition sequence. (3) Inclusion of 
sites for which enzymes are widely available. (4) Inclusion of sites for enzymes that 
are known to be reliable cutters. 

1 0 [00237] pLLl .3 was digested sequentially with Apal and Xhol. The digest was 
then purified by Qiaquick PCR purification kit (Qiagen) to eliminate the small DNA 
fragment between Apal and Xhol. The fragment was then treated with Shrimp 
Alkaline Phosphatase (SAP) to eliminate 5 '-phosphate groups. The oligonucleotides 
described above were synthesized, 5 5 phosphorylated, and PAGE-purified by IDT 

15 Corp. (See Web site having URL www.idtna.com. 60 picomols of each oligo were 
annealed in annealing buffer (lOOmM Potassium Acetate, 30mM HEPES-KOH pH 
7.4, 2mM Magnesium acetate) by incubation at 95 degrees for 4 minutes, followed by 
70 degrees for 10 minutes, then slowly cooled (.1 degrees/second) to 4 degrees, then 
maintained at 4 degrees for 10 minutes. The annealed oligos were diluted and ligated 

20 at an equimolar concentration with the linearized pLLl .3 vector. A plasmid 

containing the engineered MCS was identified by restriction digest and named 
pLLl .4. 

[00238] It was at this time that we first realized that we had destroyed the NotI site 
in the MCS rather than in the plasmid backbone (see above). The second NotI site 

25 (adjacent to the LTR) was then destroyed in pLLl .4 as follows. pLLl .4 was digested 
with NotI to linearize the plasmid. The ends were 5 5 phosphorylated with PNK and 
were filled-in with cloned Pfu to blunt and destroy the NotI site. The plasmid was 
recircularized by ligation and transformed into bacteria. We checked for destruction 
of the NotI site by restriction digest and named the resulting plasmid pLentiLox 1.5 

30 (pLLl .5). 

[00239] We next sought to expand the 3' MCS. We designed primers to introduce 
Nsil, SphI, Smal/Xmal, AscI, and BamHI sites between the NotI and EcoRI sites. 
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We followed the same design criterion as described above. An additional criterion 
was that no three consecutive nucleotides would generate a nonsense codon. This 
would allow us to produce fusion proteins in which MCS sequence can remain 
between the fused proteins while minimizing the likelihood of premature termination 
5 of translation. In addition we wanted a minimum of sites in the MCS to be present in 
the sequences encoding EGFP and dsRed2, which we intended to include in many 
derivatives of our vectors (see below). We purposefully intended to make this MCS 
more versatile then the 5' MCS since we anticipated that most applications of our 
vector would require cloning into this MCS. The inclusion of an SphI site was 
1 0 fortuitous. The two nucleotide spacer between the Nsil and Smal/Xmal led to the 

creation of an SphI site that overlaps these other two sites. More fortuitously, SphI is 
a unique site in pLLl .5. The oligonucleotide primers incorporating the desired 
restrictions sites are as follows: 

[00240] 3' MCS Sense: 5' ggccgccgatgcatgccccgggatggcgcgccatggatccgcg 3' 
15 (SEQEDNO: 13) 

[00241] 3' MCS Antisense: 5' aattcgcggatccatggcgcgccatcccggggcatgcatcggc 3' 
(SEQ ID NO: 14) 

[00242] Because we had destroyed the NotI site that should have been present in 
the pLL1.5 3' MCS we had to use a different strategy to insert the oligonucleotides 

20 than was used for the 5' MCS. The pBS-Lox (described above) plasmid was digested 
with NotI and EcoRI enzymes. The small DNA fragment that was liberated was 
eliminated by purifying the linearized pBS-Lox backbone in a Qiaquick PCR 
purification kit. The DNA was SAP treated. The 3' MCS oligonucleotides were 
annealed (see above). 150 fmols of annealed oligonucleotides and cut pLLl.5 were 

25 ligated together and transformed into bacteria. A plasmid containing the expanded 3' 
MCS was identified by restriction digest and named pBS-Lox-MCS. 
[00243] To insert the expanded MCS from pBS-Lox-MCS into pLLl .5, we 
replaced the EcoRI-XhoI fragment from pLLl.5 (containing the improperly destroyed 
NotI site) with the EcoRI-XhoI fragment from pBS-Lox-MCS (containing an intact 

30 NotI site and the expanded 3' MCS). A plasmid containing the expanded MCS and 
intact NotI in the pLL backbone was identified by restriction digest and was named 
pLentiLox2.0 (pLL2.0). 
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[00244] Production of useful pLL2. 0 series vectors 

[00245] We next sought to produce constructs that would be useful starting points 
for many potential uses of our lentiviral vectors. One application in which there is 
enormous interest is the generation of fusion proteins in which a protein of interest (or 
5 a portion thereof) is fused with a fluorescent protein. In particular, EGFP and dsRed2 
are fluorescent proteins that are well characterized and for which sequences are 
widely available. 

[00246] pLL2.1 was engineered to include the EGFP open reading frame. The 
EGFP open reading frame was amplified from pEGFP-Nl (Clontech) to include a 

10 5'NotI site and a 3' Nsil site. The oligonucleotides used were: 

[00247] EGFP/5'NotI: 5'-cggcggccgcgccaccatggtgagcaagggc-3' (SEQ ID NO: 15) 
[00248] EGFP/3'NsiI: 5'-cgatgcatcttgtacagctcgtccatgccg-3' (SEQ ID NO: 16) 
[00249] The PGR product was isolated by agarose gel electrophoresis, gel purified, 
and cloned into the NotI and Nsil sites of pLL2.0 to create pLL2. 1 . 

15 [00250] pLL2.2 was engineered to include the dsRed2 open reading frame. The 

dsRed2 open reading was amplified from pdsRed2-Nl (Clontech) to include a 5 'NotI 
site and 3 'Nsil site. The oligonucleotides used were: 

[00251] dsRed2/5'NotI: 5'-cggcggccgcgccaccatggcctcctccgag-3' (SEQ ID NO: 17) 
[00252] dsRed2/3'NsiI: 5 5 -cgatgcatcaggaacaggtggtggcggccc-3' (SEQ ID NO: 18) 

20 [00253] The PGR product was isolated by agarose gel electorpheresis, gel purified, 
and cloned into the NotI and Nsil sites of pLL2.0 to create pLL2.2. 
[00254] Because the pLentiLox series has a self-inactivating 5' LTR, the pro virus 
has no endogenous 5' promoter activity. Therefore, it is necessary to include an 
internal promoter to drive transgene expression. This makes the system compatible 

25 with tissue-specific promoters. We chose to clone a ubiquitous and constitutive 

promoter into our vector to create a transgenic system that should be active in most 
eukaryotic cell types as well as in all the tissues of a mouse. The promoter we chose 
was the Ubiquitin C promoter (UbC). We first attempted to clone UbC by PGR 
amplification of the promoter from the pUB6/V5/His vector. However, the UbC 

30 sequence was not robustly amplified via PCR (in several attempts). As a second 

strategy we digested the pUB6/V 5/His vector with Bglll and Hindlll which generates 
a fragment containing the UbC promoter. This fragment was isolated by agarose gel 
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electrophoresis and gel extraction. The fragment was filled-in with cloned Pfu 
polymerase and 5' phosphorylated with PNK. This fragment was ligated into Hpal 
digested pLL2.0, pLL2.1, and pLL2.2 to generate pLL2.3, pLL2.4, and pLL2.5 
respectively. These plasmids were verified by restriction digest to contain the proper 
5 insert. pLL2.4 and pLL2.5 were transfected into 293.T cells and production of the 
correct fluorescent protein was verified by visualizing the transfected cells under an 
epifluorescent microscope 24 hours after transfection. 

[00255] As described in the following example, we later discovered that one of the 
loxP sites in the pLL2.0 series of vectors contained a deletion that rendered it 
10 unusable. Jh order to generate the pLLS.O series (which contain two wild type loxP 
sites) from the pLL2.1-pLL2.7 series we cloned Apal-EcoRI inserts of various sizes 
(2.1-917 bp, 2.2-875bp, 2.3-1417bp, 2.4-2,138bp, 2.5-2096bp, 2.6-1519bp, and 2.7- 
1819bp) from the pLL2.1-pLL2.7 vectors into the 5,83 lbp Apal-EcoRI backbone 
from pLL3.0 to create pLL3.1-pLL3.7. 

15 

[00256] Example 2: Generation ofLentiviral Vectors for RNAi 
[00257] Modification of pLL2. 0 for use in RNAi In order to drive expression of an 
RNAi-inducing stem-loop, we decided to incorporate a polIII promoter into a 
pLentilox series vector. In addition, we decided to incorporate a polll promoter to 
20 drive expressin of EGFP as a reporter. Because we were concerned that the 

placement of a strong polll promoter near a polin promoter might interfere with 
polHI function we chose to place the polII-EGFP cassette between LoxP sites. This 
would allow us to eliminate the polll promoter if we were failing to accumulate the 
stem-loop RNA. 

25 [00258] We first inserted a cassette to drive expression of EGFP. ADNA 

fragment containing the CMV promoter upstream of the EGFP open reading frame 
was amplified from pEGFP-Cl. The oligonucleotides were selected to engineer a 5' 
NotI site and 3' EcoRI site. The oligonucleotides used were: 
[00259] 5'CMV/NotI: 5'-cggcggccgcgtggataaccgtattaccgccatg-3 ? (SEQ ID NO: 

30 19) 

[00260] 3'EGFP/stop/EcoRI: 5' cggaattcctacttgtacagctcgtccatgccgag-3 5 (SEQ ID 
NO: 20) The PCR product was isolated by agarose gel electrophoresis and purified by 
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gel extraction. The fragment was cloned into the NotI and EcoRI sites of pLL2.0 to 
create pLL2.6. This plasmid was tested by restriction digest and production of EGFP 
was verified by transfection of pLL2.6 into 293. T cells. 

[00261] The insertion of the U6 promoter presented an additional challenge. We 
5 needed to introduce a cloning site for the introduction of RNAi sequences. The U6 
promoter includes sequences required for activity up until the +1 transcriptional start 
site. Therefore, one cannot modify the sequences prior to -1 without incapacitating 
U6. We did not want to introduce a site after +1 as that would dictate that the first 
several nucleotides of the stem-loop would be derived from the restriction site. We 

10 therefore engineered the U6 promoter to introduce an Hpal site that cuts at the -1 

position of U6. The first three nucleotides of the Hpal site are present in the wildtype 
U6. We had to alter the nucleotides at -1 to +2 in order to engineer an Hpal site. As 
a result the U6 promoter is not functional when containing the Hpal site. However 
after digestion with Hpal and introduction of oligonucleotides to code for a stem- 

15 loop, those oligonucleotides can re-generate the wild-type 3 5 end of the U6 promoter 
thereby restoring transcriptional activity. We engineered oligonucleotides to add a 
Xbal site to the 5' end of the U6 promoter and Hpal, BstEII, and Xhol sites to the 3' 
end. We cloned the amplified PGR product from the pmU6 plasmid and introduced 
the product into the Xbal and Xhol sites of pLL2.6. The oligonucleotides used were: 

20 [00262] 5' Xbal/U6: 5'-gctctagagatccgacgccgccatctctag-3' (SEQ ID NO: 21) 
[00263] 3' XhoI/BstEII/HpaI/U6: 5'- 

gcctcgagggtcaccgcgcgttaacaaggcttttctccaaggg-3 5 (SEQ ID NO: 22) 

[00264] The resulting plasmid was verified by both restriction digest and by 

sequencing and was named pLL2.7. 

25 [00265] Repair of LoxP and engineering of new restriction site 

[00266] It was at this point that we recognized that the 3 ' LoxP site in the original 
pBS-Lox contained a three nucleotide deletion that rendered it unusable. We decided 
to fix the 3' LoxP site in the pLL2.0 plasmid and then use this plasmid backbone to 
clone in sequences from pLL2.1-pLL2.7. The repair of the LoxP site gave us an 

30 opportunity to engineer a new restriction site outside of the LoxP site (between the 3 5 
LoxP and the WRE). This site would give our plasmid series even greater flexibility 
for engineering other additions such as IRES-GFP, or inducible expression systems. 
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[00267] Oligonucleotides were designed to amplify a fragment from pLL2.0 from 
the EcoRI site to a PflMI site located within WRE. The 5' oligonucleotide extended 
from the EcoRI site in the 3 ? MCS through the mutant LoxP site and into the region 
between the LoxP and the WRE. This oligonucleotide was designed to add the 
5 deleted nucleotides to the LoxP site and to create a Pcil site immediately following 
the LoxP site. The amplified DNA was inserted into the pLL2.0 backbone digested 
with EcoRI and PflMI. The oligonucleotides used were: 
[00268] 5 5 EcoRI/LoxFix/PciI: 

5'-gcgaattcgtcgagggacctaataacttcgtatagcatacattatacgaagttatacatgtttaagggttccgg-3' 
10 (SEQIDNO:23) 

[00269] 3' PflMl/Rev: 5'-aaggagctgacaggtggtggcaatg-3' (SEQ ID NO: 24) 
[00270] A plasmid was checked by sequencing for the addition of a correct LoxP 
and Pcil sites and named pLL3.0. 

[00271] In order to generate the pLL3.1-pLL3.7 series from the pLL2. l-pLL2.7 
15 series we cloned Apal-EcoRI inserts of various sizes (2.1-917bp, 2.2-875bp, 2.3- 
1417bp, 2.4-2,138bp, 2.5-2096bp, 2.6-1519bp, and 2.7-1 819bp) from the pLL2.1- 
pLL2.7 vectors into the 5,83 lbp Apal-EcoRI backbone from pLL3.0 to create 
pLL3 . 1 -pLL3 .7. All plasmids were verified by restriction digest. 

20 [00272] Example 3: Specific Silencing of Genes in T Cells using a Lentiviral 
Vector 

[00273] Materials and Methods 

[00274] Cell culture: E10 and primary mouse splenocyte cultures were performed 
as previously described (1 1). 293T cells (human fibroblasts) were cultured as 
25 described (21). In vitro T-cell proliferation was performed on 200,000 activated T- 
cells cultured in the presence/absense of increasing doses of IL2 (0 to 100 ng/ml) and 
pulsed for 6 h with [ 3 H]TdR to assay proliferation. 

[00275] Oligonucleotide design. The following approach was used to design 
oligonucleotides suitable for cloning into pLL3.7 vectors to generate vectors capable 
30 of directing synthesis of shRNAs for gene silencing in this and the following 
examples. As described above, we have engineered a multiple cloning site 
immediately following the U6 promoter. An Hpal site leaves a blunt end prior to the - 
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1 position in the promoter. The oligonucleotide design must incorporate a 5' T in 
order to reconstitute the -1 nucleotide of U6. An Xhol site cuts downstream of the U6 
start site. The following oligonucleotide format was used: 

[00276] Sense oligonucleotide: 5 'T-(GN1 8)-(TTCAAGAGA)-(8 1NC)-TTTTT 
5 [00277] Antisense oligonucleotide: Complement of sense but with additional 
nucleotides at 5' end to generate Xhol overhang. 

The loop sequence (TTCAAGAGA) (SEQ ID NO: 10) is based upon Brummelkamp 
et al. (Science 2002). 

10 

Oligonuclotides with 5' phosphates and PAGE purified were ordered from Integrated 
DNA Technologies (IDT), Coralville, LA. 

[00278] Generation of lentiviral transfer plasmids containing shRNAs targeted to 
15 CD8. 

[00279] Oligonucleotides having the following sequences were inserted into 
pLL3.7 to produce lentiviral transfer plasmids capable of directing expression of an 
shRNA targeted to CD8. 

[00280] CD8 sense: 5'- tgctacaactactacatgacttcaagagagtcatgtagtagttgtagcttttttg -3 5 
20 (SEQ ID NO: 36) 

[00281] CD8 antisense: 5 5 - 

gttacaaaaaagctacaactactacatgactctcttgaagtcatgtagtagttgtagca-3 5 (SEQ ID NO: 37) 
[00282] The following protocol was used to clone oligonucleotides into pLL3.7 in 
this and the following examples: 
25 [00283] Oligos are resuspended in water at 60pmolA, and annealed as follows. 
Annealing oligos: 
IX Sense oligo 
IX Antisense oligo 
48X Annealing Buffer 

30 

Annealing Buffer Recipe: 

lOOmMK-acetate 

30mM HEPES-KOH pH 7.4 
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2mM Mg-acetate 

Incubate at 95° 4min 
70° lOmin 

5 Decrease temperature to 4° slowly (. 1 °C/min) 
Incubate at 4° 10 min 

pLentiLox 3.7 is digested as follows: 
Digest l-2jLLg with Xhol and Hpal 
1 0 Treat with SAP or with CIP 
Purify linearized fragment 
Estimate concentration 

Ligation is performed as follows: 
15 Ligate linearized product and annealed oligos at equimolar concentration. I typically 
use 60fmol of each component in a final concentration of lOjaL. 

Transformation is performed according to standard techniques. The use of an endA" 
strain of E. coli, e.g., STBL-2 cells is strongly recommended. 

20 

Clones are tested for the presence of inserts as follows: 

We have had success testing for insertion of the stem-loop sequence with both colony 
per or by restriction digest. Insertion of insert causes a band shift of ~60bp in an 
Xbal/NotI fragment when compared to parental vector. This can be seen by 2% 
25 agarose gel electrophoresis. RPML Cell viability immediately after electroporation 
was typically around 60%. 

[00284] Electroporation: For electroporations, 10 jug of LentiLox plasmid were 
added to prechilled 0.4-cm electrode gap cuvettes (Bio-Rad, Hercules, CA). E10 cells 
(1.5 X 10 7 ) were resuspended to 3 x 10 7 cells/ml in cold serum-free RPMI, added to 
30 the cuvettes, mixed, and pulsed once at 300 mV, 975 |iF with a Gene Pulser 

electroporator II (Bio-Rad). After electroporation, the cells were put into four wells of 
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a 24-well plate, each containing 1 ml of RPMI. Cell viability immediately after 
electroporation was typically around 60%. 

[00285] Flow cytometry: For flow cytometric analysis described in this and 
subsequent examples, all cells were washed once in FACS buffer (PBS supplemented 
5 with 2% FCS and 0.01% sodium azide), resuspended to 200 jul, and stained directly 
with the appropriate antibodies. The stained cells were washed once, then 
resuspended in 100 fil FACS buffer containing 5 jxg/ml propidium iodide (PI). 
Unstained and singly stained controls were included in every experiment. Cell data 
were collected on a FACSCalibur flow cytometer (BD Biosciences, San Jose, CA) 
10 and four-color analyses (GFP, PE, PI, and allophycocyanin) were done with 

CellQuest software (BD Biosciences). All data were collected by analyses performed 
on at least 2.5 x 10 5 Pi-negative events (viable cells). 

[00286] The following phycoerythrin (PE) conjugated antibodies were used in this 
and the following examples: anti-CD4 (clone RM4-5), anti~CD8oc (clone 53-6.7), 
15 anti-CD25 (clone PC81), anti TCRp (clone H57-597), anti-CD28 (clone 37.51) and 
strepavidin. Allophycocyanin (APC)-conjugated anti-CD8a and biotin-conjugated 
anti-CD3 were also used for analysis. All antibodies were from BD Pharmingen (San 
Diego, California). All plots shown are gated for viable cells, which were isolated by 
selecting PI" cells. 

20 [00287] Northern blot analysis: For Northern blot analysis, cells were lysed with 
Trizol reagent (Invitrogen), and total cellular RNA was prepared according to the 
manufacturer's instructions. CD4/CD8 probe hybridization and was performed as 
described (1 1). For the small RNA Northern, total RNA (60 |ixg) was fractionated on a 
10% denaturing polyacrylamide gel and transferred to nylon membrane. The 

25 membrane was hybridized to a probe consisting of a 21nt CD8 siRNA sense strand 5' 
end-labeled with 32 P. A 5' radiolabeled oligonucleotide probe to 5S RNA was used 
to determine equal loading of RNA. The probe for the siRNA CD8 was taken exactly 
from the sense strand in the plL3.7 CD8 stem-loop as described in reference 1 1 . 
[00288] Results 

30 [00289] To determine whether lentiviral vectors could deliver shRNAs and silence 
genes in mammalian cells, the pLL3.7 vector described in Example 2 that carries the 
U6 RNA polymerase III (polIII) promoter (Fig. 17A) was used. This promoter is 
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known to efficiently transcribe small RNAs that silence gene expression (2, 26). The 
pLL3.7 vector was also engineered to express EGFP as a reporter gene, permitting 
infected cells to be tracked by flow cytometry. EGFP expression was driven by a 
constitutive RNA polymerase II (polll) promoter derived from cytomegalovirus 
5 (CMV) (Fig. 17B). This promoter is active in most mammalian tissues (27). The 
CMV promoter of pLL3.7 was placed between LoxP sites to allow removal of this 
genetic element if transcript levels from the U6 promoter were not sufficient to 
silence gene expression due, for example, to possible promoter interference that might 
decrease expression of shRNAs in infected cells. 
10 [00290] To test whether pLL3.7 could be used to silence gene expression in 
mammalian cells, the sequence for a shRNA predicted to target the T cell surface 
molecule, CD8, was introduced into this vector to generate pLL3.7 CD8 (Fig. 17B). 
The CD8 shRNA duplex sequence (SEQ ID NO: 25: 5'- 

TGCTAC AACTACTAC ATGAC-3 ' when expressed in DNA format) was based on 

1 5 sequences that we had previously characterized in the CD8 + El 0 thymoma cell line 
(1 1, 28), and that we had shown will specifically downregulate CD8 in these cells 
when introduced as siRNAs. As a first test we electroporated pLL3.7 CD8 or a 
pLL3.7 vector containing a stem loop targeted to an unrelated sequence (CD25T, a 
stem loop targeted to CD25 but containing a mutation resulting in an early 

20 termination site) into E10 cells, and quantified expression of CD8 in transfected cells 
by flow cytometry. E10 cells that took up pLL3.7 CD8 or pLL3.7 CD25T DNA 
could be identified by flow cytometry based on their expression of GFP, i.e., cells that 
took up the vector became GFP-positive. As shown in Figure 23, GFP + E10 cells 
transfected with pLL3.7 CD8 (lower panel) showed on average a 7-fold reduction in 

25 CD8 levels relative to cells transfected with pLL3.7 CD25T (middle panel). This 
result demonstrated that pLL3.7 CD8 was able to silence expression of CD8 in T 
cells. Since we could detect GFP + and CD8-silenced cells, promoter interference did 
not present a major barrier for co-expression of shRNAs and a reporter gene. 

An aspect of the data that should be noted is that although Figure 23 appears 

30 to suggest that higher levels of GFP correlates with decreased CD8 

expression, which would suggest that more copies of the lentiviral DNA result in 
greater silencing, this is an artifact due to the fact that when subtracting signal to 
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correct for overlap between signals, more signal from the PE (CD8) channel was 
subtracted from the GFP channel than should have been, which makes the cells with 
high GFP appear to have less CD8. In reality, it appears that any expression from the 
U6 promotor, regardless of number of integrants or copies leads to full silencing. A 
5 further complication of this experiment is that GFP expression from the plasmid is 
actually highest after 24 hrs. As a consequence, at 48 hours, there are many silenced 
cells that have become GFP". 



[00291] Example 4: Production of Infectious Lentiviral Particles using Lentiviral 
1 0 Transfer Plasmids Containing shRNA Sequences 
[00292] Materials and Methods 

[00293] Cell culture and lentivirus production. Cell culture was performed as in 
Example 3. Lentiviral production was performed as described (24) using packaging 
plasmids pMDL g/pRRE, pCMV VSV-G, and pRSV-REV, described in references 21 
15 and 40. 

[00294] Harvesting and titering lentivirus. Lentivirus was harvested and titered 

according to the following protocol: 

Harvesting: 

1 . Harvest supernatant from cells and spin at 25,000 rpm for 1.5hrs 
20 2. Remove all liquid, add x volume of PBS (between 15 and 200ul), and allow to sit 
overnight at 4 degrees 

3. Pipette up and down -20 times 

4. Use or aliquot and flash-freeze in liquid N2, store at -80. 
Titering: 

25 

1. Plate 4xl0 5 293.T cells/well in a 6-well plate 12-24 hours prior to titering. It is 
helpful to have an additional well as a negative control that you mock infect with 
D10+polybrene but without virus. 

2. Make a stock solution of D10 medium with 8|ag/ml polybrene. 

30 3. Generate a 10-fold dilution series of virus in the DIO+polybrene. Using 

1.5mls/well you should have ljul, .1, .01, .001, .0001, and .00001|llL of virus/well. 
4. Incubate at 37 degrees O/N. Replace media with fresh D10. 
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5. At least 48 hours after infection trypsinize cells for FACS analysis. (Trypsinize, 
inactivate with media, spin, and resuspend in cold PBS). 

6. FACS analyze for EGFP expression and record the percentage of cells that are 
EGFP positive. 

5 7. Use a well that has between .1% and 10% of cells expressing EGFP to determine 
titer. 

Sample calculation assuming 1% infection from the well with .Oljul of virus: .01 
(percentage of cells that are EGFP positive) x4x 10 5 = 4x 10 3 positive cells. 
4 x 10 3 x 100 (dilution factor) - 4 x 10 5 viral particles/ul. 

10 

In general at least 5 x 10 5 viral particles/ul should be used for embryo infections. 

[00295] Transfection of 293 cells. The following protocol was used for 
transfection of 293 cells in this and the following examples. 

15 

1. Plate 12 x 10 6 293.T cells in 20 ml on a 15 cm 2 plate 24 hours before 
transfection. In general, two 15 cm plates per virus are used. It is highly 
preferred that the cells be well-maintained and of relatively low passage 
number. 

20 2. Mix the following DNAs (preferably made using Endo-free Qiagen Kits 

according to the manufacturer's instructions) in a FACS tube. The DNAs 
should be in Endo-free TE at a concentration of 0.5|iig/|Lil. 
For 3 plasmid system: 

20 jag vector (transfer plasmid, e.g., pLL3.7 CD8) 
25 1 0 ^g p VS VG (envelope plasmid) 

15 jug A 8.9 (packaging plasmid) 
For 4 plasmid, system (recommended), 

20 jug vector, (transfer plasmid, e.g., pLL3.7 CD8) 
lOjagpVSVG (envelope plasmid) 
30 10 jug RSV-REV (plasmid supplies Rev protein) 

10 ]ig pMDL g/p RRE (packaging plasmid) 
The envelope plasmid, packaging plasmids, and Rev-supplying plasmid are 
described in further detail in references 21, 24, 40, and 41. 
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Add 400 |Ltl 1.25 M CaCl 2 and 1.5 ml H 2 0 and mix by tapping gently. 
The following steps are done 1 plate at a time. 

3. Add 2 ml of 2X HBS dropwise to DNA mixture while bubbling with a Pasteur 
5 pipette. When finished, continue to bubble for 12-15 seconds. 

4. Take plate of 293T out of the incubator (plate remains in incubator for long as 
possible), and add transfection mixture dropwise all over the plate. Gently 
swirl plate from front to back, and return immediately to incubator. 

5. 3.5 to 4 hours later, remove media, wash 2x with 10ml warm PBS, and add 20 
10 ml warm D10 onto plate and place in incubator. 

6. 36-48 hours after transfection, harvest viral supernatant and spin at 2000 rpm, 
7 min at 4°C in a 50ml tube. 

7. Filter viral SN through .45 urn filter. Add 35ml of filtered supernatant to an 
ultracentrifuge tube. Balance tubes with additional media. Cover tubes with 

15 small piece of parafilm. (It is useful to titer some of the leftover supernatant to 

determine if there is loss of virus during concentration.) 

8. Spin tubes using a SW-28 rotor at 25,000 rpm, 90 min, 4°C. Decant liquid 
and leave tube upside down on kimwipe for 10 min. Aspirate remaining 
media being careful not to touch bottom of tube. 

20 9. Add 15|iil cold PBS (for embryo infections, or any volume you wish) and 

leave tube at 4°C O/N with no shaking. 

10. To resuspend, hold tube at angle and pipet fluid over pellet 20 times, being 
careful not to touch pellet with tip. It is expected that the pellet not be 
resuspended after this is complete. This pellet does not contain virus and can 

25 be discarded. 

1 1 . Aliquot or use virus. Virus should be aliquoted, flash-frozen in liquid nitrogen 
and stored at -80. There should be no change in titer with freezing 
concentrated virus. Avoid multiple freeze-thaws. 

[00296] Results 

30 [00297] To test whether pLL3.7 vectors could be used to generate infectious 
lentivirus particles, 293T cells were transfected with pLL3.7 or pLL3.7 CD8, and 
three lentiviral packaging plasmids developed by Miyoshi et al. (29). Our initial 
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concern was that a lentiviral construct containing shRNA sequences would itself be 
susceptible to RNA interference, thus preventing generation of viral (RNA) genomes 
and production of infectious viral particles. We failed to find any evidence of this 
type of auto-inhibition. We were able to generate viral stocks of pLL3.7 and pLL3.7 
5 CD8 that could infect mouse fibroblast 3T3 cells, as gauged by GFP expression, and 
the titre of these viral stocks was always qualitatively similar. These results 
demonstrated that lenti viruses capable of mediating gene silencing can be generated 
efficiently. In other words, shRNAs generated by lentiviral vectors can target 
endogenous cellular transcripts but do not inhibit production of viral RNAs carrying 
10 the same sequences. 

[00298] Example 5: Stable Silencing of Genes and Production of Processed 
shRNAs in T Cells by a Lentiviral Vector 
[00299] Materials and Methods 

15 [00300] Cell culture, lentivirus production, and lentivirus infection. Cell culture 
was performed as in Example 2. Lentiviral production and infection were performed 
as described (24) in this and following examples unless otherwise indicated. For 
some experiments, sorted populations of infected E10 cells were maintained in long- 
term culture. E10 cells pLL3.7 CD8 (CD 8 RNAi virus) were sorted four days after 

20 infection for GFP expression and low CD 8 expression, while cells infected with 

control virus were sorted for GFP expression only. Each population was cultured for 1 
month and analyzed for CD 8 expression via flow cytometry at weekly intervals. 
[00301] Results 

[00302] We examined whether the LentiLox system could be used to silence gene 
25 expression upon infection of mammalian cells. To accomplish this, E10 cells were 

infected with pLL3.7 and pLL3.7 CD8 viruses. A low viral titre was used so that only 
a fraction of cells became infected, as gauged by GFP expression (Fig. 18 A). This 
allowed us to follow the fate of both infected and non-infected cells simultaneously. 
Infected (GFP*) cells on average showed a 16-fold reduction of CD8 expression. 
30 Figure 2a shows density plots indicating the expression levels of CD4 and CD 8 48 
hours post-infection. 
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[00303] Expression of CD8 and of other surface proteins was measured at 48 hours 
and at various times thereafter. Inhibition of CD8 expression in E10 cells infected 
with pLL3.7 CD8 was specific since levels of other surface proteins were not altered 
(Fig. 18B). Furthermore, in a subline of E10 cells engineered to express human CD8, 
5 which differs from mouse CD8 by 4 out of 19 nucleotides in the region that we 
targeted, we showed that pLL3.7 CDS selectively reduced expression of the mouse 
protein. Figure 24 shows expression of human CD8 in 3 populations of cells that 
either were (lower panels) or were not (upper panels) transfected with a construct 
encoding human CD8 (hCD8). The leftmost panels show expression of human CD8 

10 in wild type ES cells, illustrating expression of hCD8 in transfected cells (lower left 
panel; cells below bar display hCD8 expression. The middle panels show expression 
of human CD8 in a population of ES cells that were infected with pLL3.7 CD8 and 
displayed effective silencing of mouse CD8 (low CD8). As shown in the lower 
middle panel, this population of cells did not display silencing of human CD 8. The 

15 rightmost panels show expression of human CD8 in a population of ES cells that were 
infected with pLL3.7 CD8 and did not display extensive silencing of mouse CD8 
(high CD8). As shown in the lower right panel, this population of cells also did not 
display silencing of human CD8. These data show that an shRNA targeted to mouse 
CD8 does not silence human CD8, confirming the specificity of silencing. Cells 

20 infected with a control virus (pLL3.7) or a virus that expressed a neuron-specific 
shRNA (pLL3.7 Mena+) showed no decrease in CD8 levels (Fig. 18A and data not 
shown). 

[00304] To confirm that the decrease in surface CD8 expression seen in E10 cells 
resulted from mRNA degradation, we assayed CDS mRNA levels in sorted (GFP + ) 

25 cell populations infected either with a control virus (pLL3.7) or CD8 RNAi virus 
(pLL3.7 CD8) (Fig. 19 A). Consistent with results showing significant reduction of 
CD8 protein levels, the amount of CD8 transcripts in E10 cells infected with pLL3.7 
CD8 was 13 -fold lower then in controls (Fig. 19B). The same cells expressed normal 
amounts of CD4 transcripts (Fig. 19B), confirming the specificity of the RNAi 

30 knockdown. We also examined whether the shRNAs encoded by pLL3.7 CD8 were 
processed into the approximately 21 nucleotide-long RNAs reported to mediate RNAi 
(12) by blotting cellular RNA extracts with a probe directed against the anti-sense 
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strand of the CD 8 stem loop. Only the pLL3.7 CD8 infected cells produced CD 8 
siRNAs (Fig. 19C). The dominant population of siRNAs detected was 21 bases in 
length; although 1-2 bp longer species of siRNAs were also present (Fig. 19C). No 
precursor shRNA was visible on the autoradiogram. 
5 [00305] To test the stability of lentivirus-mediated RNAi in mammalian cells, we 
followed expression of CD8 in long-term cultures of E10 cells infected with pLL3.7 
or pLL3.7 CD8. Cells were sorted based on GFP and CD8 expression levels four days 
after infection with lentivirus, and subsequently monitored for expression of these 
proteins weekly. No change in expression of CD8 was observed and these cells 

10 remained uniformly GFP positive, demonstrating that RNAi mediated by the 

integrated lentivirus was stable (Fig. 19A). In each experiment a small fraction (2 to 
15%) of E10 cells infected with pLL3.7 CD8 showed no evidence of gene silencing, 
maintaining wild type CD8 expression (Fig. 19A). This was not necessarily the result 
of a low copy number of integrated viruses or poor expression of viral genes since 

15 some of these cells expressed very high levels of GFP (Fig. 19A). As shown in the 
Northern blot in Figure 25, we were able to determine that these cells expressed little, 
if any shRNAs directed against CD8, suggesting that the activity of the polIII 
promoter was reduced. 

[00306] Example 6: Functional Gene Silencing in Differentiated Mammalian Cells 
20 Induced by Lentiviruses 

[00307] Materials and Methods 

[00308] T-cell purification and stimulation. Cells were harvested from spleen and 
lymph nodes. They were plated in RPMI with 10% FBS supplemented with 1 ug/ml 
ova peptide. Cells were infected 24 and 48 hours after plating and analyzed 72 hours 

25 after plating. This activation method yields >90% purity of T-cells. 

[00309] Viral infection. Spin infection was performed as described for retrovirus 
in van Parijs, L., et al. 9 Immunity, 11:281, 1999 using 50 ul of concentrated lentivirus. 
[00310] We tested whether the LentiLox-based RNAi system could be used to 
silence gene expression in primary mammalian cells. In these experiments, we 

30 purified CD8 + T cells from the spleens of OTI T-cell receptor (TCR) transgenic mice, 
activated them with cognate peptide antigen, and then infected these cells with 
pLL3.7 or pLL3.7 CD8. After three days in culture, the T cells were harvested and 
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analysed for GFP and CD8 expression by flow cytometry. Between 68 and 82% of 
the cells were infected, as gauged by GFP expression (Fig. 4a). The infected (GFP + ) 
population reproducibly showed approximately a 14-fold reduction in CD8 
expression, demonstrating that lentivirus-driven expression of shRNAs efficiently 
5 silenced gene expression in primary T cells (Fig. 20 A). This effect of pLL3.7 CD 8 
was specific since infected cells showed normal expression of other T cell surface 
markers (Fig. 20A). 

[00311] We next examined whether the degree of gene silencing achieved in 
primary mammalian cells using the LentiLox system was functionally relevant. To 

10 accomplish this we performed an experiment aimed at studying the biological effects 
of targeting the IL-2 receptor (IL-2R) in T cells using lentivirus-mediated RNAi. IL-2 
is an important growth factor for T cells, and T cells derived from mice that lack the 
receptor for this cytokine fail to proliferate in vitro (31). To determine whether we 
could phenocopy IL-2R-deficiency in primary T cells using lentivirus-mediated 

15 RNAi, we designed a shRNA against the alpha chain of the IL-2 receptor (CD25) and 
used this sequence to create pLL3.7 CD25. The shRNA sequences were as follows: 
[00312] CD25 sense: 5'- tgcattcacctaatcggctgttcaagagacagccgattaggtgaatgcttttttg- 
3 5 (SEQIDNO: 38) 
[00313] CD25 antisense: 

20 5'-gtcaccaaaaaagcattcacctaatcggctgtctcttgaacagccgattaggtgaatgca-3 5 (SEQ ID NO: 
39) 

[00314] In most experiments between 70 and 85% of activated CD8+ TCR 
transgenic T cells were infected with this virus (Fig. 20 A). Infected cells on average 
showed a 25-fold reduction in IL-2Ra chain expression, but expressed normal levels 
25 of other surface markers (Fig. 20A). These cells were challenged with increasing 
concentrations of IL-2, resulting in a 4- to 5-fold reduction in the response to this 
cytokine (Figure 20B). Therefore, the LentiLox RNAi system can be used to 
phenocopy loss-of-function in primary T cells. 

30 [00315] Example 7: Functional Silencing of Genes in Embryonic Stem Cell- 
derived Mice by a Lentiviral Vector 
[00316] Materials and Methods 
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[00317] Generation of lentiviral transfer plasmids containing shRNAs targeted to 
Mena+, Beta-catenin, and p53. Oligonucleotides having the following sequences 
were inserted into pLL3.7 as described above to produce lentiviral transfer plasmids 
capable of directing expression of shRNAs targeted to Mena+, Beta-catenin, or p53 
5 transcripts. 

[00318] Mena+ sense: 5 ? - 

tgtcctgtgcctggcctactttcaagagaagtaggccaggcacaggactttttggaaac-3 5 (SEQ ID NO: 26) 
[00319] Mena+ antisense: 

[00320] 5 '-tcgagttcccaaaaagtcctgtgcctggcctacttctcttgaaagtaggccaggcacaggaca-3 5 
10 (SEQ ID NO: 27) 

[00321] Beta-catenin sense: 

[00322] 5 5 - tgtccagcgcttggctgaacttcaagagtgttcagccaagcgctggactttttggaaa-3 5 (SEQ 
ID NO: 28) 

[00323] Beta antisense: 
1 5 [00324] 5 5 - tcgatttccaaaaagtccagcgcttggctgaacactcttgaagttcagccaagcgctggaca-3 ' 
(SEQ ID NO: 29) 
[00325] P53 sense: 
[00326] 5 5 - 

tggtctaagtggagcccttcgagtgttagaagcttgtgacactcggagggcttcacttgggcctttttggaaa-3' (SEQ 
20 ID NO: 30) 

[00327] P53 antisense: 
[00328] 5' - 

tcgatttccaaaaaggcccaagtgaagccctccgagtgtcacaagcttctaacactcgaagggctccacttagacca - 3 ' 
(SEQ ID NO: 31) 

25 [00329] ES Cells: AK7 ES cells were maintained and infected as described (23). 
Clones of ES cells were picked, expanded, and analyzed by flow cytometry for GFP 
expression. If the clone contained a mixed population of infected and uninfected 
cells, the GFP population was purified by fluorescence activated cell sorting. 
[00330] Production of transgenic mice: Transgenic mice were generated 

30 essentially as described in reference 24. 
[00331] Results 
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[00332] A unique feature of lentivirus-based vectors is that they can stably express 
transgenes in stem cells and are not silenced during development, allowing for the 
efficient generation of transgenic mice (23, 24). We tested whether the LentiLox 
RNAi system could be used to silence gene expression in stem cells, as well as 
5 animals generated from these cells. To accomplish this we infected embryonic stem 
cells with versions of the pLL3.7 vector that expressed shRNAs against CD8 (pLL3.7 
CD8), Mena+ (pLL3.7 Mena+), or p53 (pLL3.7 p53). 

[00333] We found that these vectors could efficiently infect embryonic stem cells, 
and we are able to generate and maintain stable lines of infected ES cells (Figure 21 A, 

1 0 and data not shown). 

[00334] To test whether ES cells infected with RNAi lentivirus were capable of 
giving rise to progeny that showed gene silencing, we generated uniformly GFP+ ES 
cell populations infected with pLL3.7 CD8, pLL3.7 Mena+, or the empty vector, 
pLL3.7, by cell sorting. Ten to twelve of these cells were injected into day 3 

1 5 blastocysts, which were subsequently implanted into pseudopregnant recipients. To 
ensure that the lentivirus-infected ES cells contributed to immune tissues in the 
chimeric offspring, we used RAG2" 7 " blastocysts in these experiments. This genetic 
lesion blocks the development of B and T cells, so that any immune cells present in 
the chimeric progeny must be derived from the injected (wild type) ES cells (32). 

20 Using this approach we generated mice derived from ES cells infected with pLL3.7 
CD8, pLL3.7 Mena+, and pLL3.7. The degree of chimerism in these animals was 
between 50 and 90% as gauged by GFP fluorescence analysis of whole mice, as well 
as dissected organs (Figure 2 IB and data not shown). This result demonstrated that 
cells expressing siRNAs were not selected against during development and that these 

25 cells were able to contribute to all tissues in the body. 

[00335] In our chimeric mice, almost all cells in the lymphoid organs expressed 
GFP, indicating that they were derived from the injected ES cells (Figure 21C and 
data not shown). To examine whether lentivirus-mediated expression of shRNAs 
resulted in the silencing of CD8 in vivo, we harvested the thymus and spleen of 7 day- 

30 old chimeric mice and stained the cells present in these organs with antibodies against 
CD8 and CD4 according to standard techniques. We found that developing T cells in 
the thymus of pLL3.7 CD8 mice showed a 7-fold reduction in CD8 expression 



Page 98 of 171 



WO 2004/022722 



PCT/US2003/028111 



(Figure 2 ID). Furthermore, no mature CD8+ T cells were detected in this organ or in 
the spleen (Figure 2 ID). In contrast, thymocytes from these mice showed normal 
expression of CD4 and normal numbers of mature CD4+ T cells were found in their 
lymphoid organs. (Figure 21D). T cell differentiation and numbers were normal in 
5 mice derived from pLL3.7 Mena+ and pLL3.7 infected ES cells (Figure 21D and data 
not shown). 

[00336] Example 8: Cre-mediated Extinguishing of a Transgene Expressed from a 
Lentiviral Vector 

10 [00337] This example demonstrates that introduction of Cre recombinase into cells 
expressing a transgene from a lentiviral vector of the invention extinguishes 
expression of the transgene. 
[00338] Materials and Methods 

[00339] Expression ofEGFP using a lentiviral vector. A 50% confluent 10 cm 
15 plate of D7 cells (See Bear JE, et aL, Cell 2000 Jun 23;101(7):717-2 for description of 
cells and culture conditions.), was infected with lOOul of concentrated pLL3.7 B- 
catenin lentivirus, which expressed GFP as a transgene between two LoxP sites. 
Infected cells (pLL3.7 B-catenin containing cells) were sorted based upon expression 
ofEGFP. 

20 [00340] Introduction of Cre. A 50% confluent 6 cm plate of sorted D7 pLL3.7 B~ 
catenin containing cells was infected with adenovirus expressing the Cre recombinase 
(Jackson EL, et aL, Genes Dev 2001 Dec 15;15(24):3243-8. lxlO 5 infectious units 
were used in the infection. Cells were expanded for 10 days to allow time for 
expression of Cre protein, deletion of the loxP-CMVegfp-loxP cassette, and depletion 

25 of EGFP protein pools. Cells were then analyzed by flow cytometry for expression of 
EGFP. Cells were also sorted based upon loss ofEGFP expression and expanded. 
[00341] Results 

[00342] Figure 22A shows flow cytometric analysis ofEGFP expression in cells 
infected with an EGFP-expressing lentiviral vector in which the promoter and EGFP 
30 coding sequences are floxed. Flow cytometry was performed at least 48 hours after 
infection. The solid purple peaks in Figure 22 A and 22B represent uninfected cells. 
As shown in Figure 22A, ninety percent of the infected cells express EGFP. The 
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population of cells expressing EGFP is shown with a green line. Figure 22B shows 
flow cytometric analysis of EGFP expression in cells infected with an EGFP- 
expressing lentiviral vector 10 days after induction of Cre expression. The percentage 
of EGFP-expressing cells is reduced to 49%. Figure 22C shows a direct comparison 
5 between pLL3.7 infected D7 cells before (green line) and after (pink line) Cre 
delivery. Induction of Cre extinguished expression of the floxed transgene in 
approximately half the cells. (The adenoviral titer was not high enough to infect all 
cells, thus cells in which the transgene was not extinguished were probably not 
infected with adenovirus.) 

10 
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[00343] Sequences of pBFGWand pLL3. 0 - pLL3. 7. This section presents the 
sequences of plasmids pBFGW and pLL3.0 — pLL3.7 in the form of GenBank files. 
[00344] pBFGW 

LOCUS PBFGW. GB 10441 BP DS-DNA CIRCULAR SYN 23 - JAN- 

2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 

promoter 

gene 



rep_origm 

polyA__signal 

gene 

polyA_signal 
rep_origin 
promoter 
LTR 

mi s c_f eature 

mis cofeature 

promoter 

promoter 

intron 

gene 

mis c__f eature 
misc feature 



2414 A 



Location/Qualifiers 
212 . . 816 
/note="CMV 1" 
9448 . . 10308 
/note="AmpR" 
8630. .9303 
/note="pUC" 
6452 . . 6666 
/note="BGH pA" 
7613 . . 7987 
/note="ZeoR" 
8117. .8246 
/note="SV4 0 pA" 
6729. . 7142 
/note=" fl origin" 
7205. .7530 
/note= n SV4 0 ori" 
835 . . 1509 
/note="5' LRT" 
1533 . .2390 

/note="psi sequence" 
2416. .2593 
/note="FLAP" 
2612 . .2974 
/note="CMV 2" 
2798 . .4229 

/note="Beta actin promoter" 
4234. .4327 

/note="beta globin intron" 

4373 . .5089 

/note="EGFP" 

5132 . .5721 

/note="WRE" 

5737. .6426 

/note="3' LTR" 

2697 C 2911 G 2419 T 



0 OTHER 



BASE COUNT 
ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGC C AG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGC CAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGC CC AT 
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3 01 ATATGGAGTT CCGCGTTACA 
CCGCCCAACG 

3 61 ACCCCCGCCC ATTGAGGTCA 
ATAGGGACTT 
5 421 TCCATTGACG TCAATGGGTG 

GTACATCAAG 

481 TGTATCATAT GCCAAGTACG 
CCCGCCTGGC 

541 ATTATGC C C A GTACATGACC 
10 T AC GTATTAG 

601 TCATCGCTAT TACCATGGTG 
GGATAGCGGT 

661 TTGACTCACG GGGATTTCCA 
TTGTTTTGGC 
15 721 ACCAAAATCA ACGGGAGTTT 

ACGCAAATGG 

781 GCGGTAGGCG TGTACGGTGG 
TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG 
20 CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC 
GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT 
TAGCAGTGGC 
25 1021 GCC CGAAC AG GGACTTGAAA 

CGCAGGACTC 

10 81 GGCTTGCTGA AGCGCGCACG 
ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGCTAG 
30 ATTAAGCGGG 

12 01 GGAGAATTAG ATCGCGATGG 
AAAAAATATA 

12 61 AATTAAAACA TATAGTATGG 
AATCCTGGCC 

35 1321 TGTTAGAAAC ATCAGAAGG C 

TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA 
TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC 
40 GAGCAAAACA 

15 01 AAAGTAAGAC CACCGCACAG 
AGGAGATATG 

15 61 AGGGACAATT GGAGAAGTGA 
AC CATTAGGA 
45 1621 GTAGCACCCA C C AAGGC AAA 

AGTGGGAATA 

1681 GGAGCTTTGT TCCTTGGGTT 
AGCGTCAATG 

1741 ACGCTGACGG TACAGGCCAG 
50 GAACAATTTG 

18 01 CTGAGGGCTA TTGAGGCGCA 
CATCAAGCAG 

18 61 CTCCAGGCAA GAATCCTGGC 
CCTGGGGATT 
55 1921 TGGGGTTGCT CTGGAAAACT 

TAGTTGGAGT 

1981 AATAAATCTC TGGAACAGAT 
CAGAGAAATT 



TAACTTACGG TAAATGGCCC GCCTGGCTGA 
ATAATGACGT ATGTTCCCAT AGTAACGCCA 
GAGTATTTAC GGTAAACTGC CCACTTGGCA 
CCCCCTATTG ACGTCAATGA CGGTAAATGG 
TTATGGGACT TTCCTACTTG GCAGTACATC 
ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
AGTCTCCACC CCATTGACGT CAATGGGAGT 
CCAAAATGTC GTAACAACTC CGCCCCATTG 
GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TACTGGGACA GCTACAACCA 
TCATTATATA ATACAGTAGC AACCCTCTAT 
AC C AAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG CTGATCTTCA GAC CTGGAGG 
ATTATATAAA TATAAAGTAG TAAAAATTGA 
GAGAAGAGTG GTGCAGAGAG AAAAAAGAGC 
CTTGGGAGCA GCAGGAAGCA CTATGGGCGC 
ACAATTATTG TCTGGTATAG TGCAGCAGCA 
ACAGCATCTG TTGCAACTCA CAGTCTGGGG 
TGTGGAAAGA TACCTAAAGG ATCAACAGCT 
CATTTGCACC ACTGCTGTGC CTTGGAATGC 
TTGGAATCAC ACGACCTGGA TGGAGTGGGA 



Page 102 of 171 



WO 2004/022722 



PCT/US2003/028111 



2 041 AACAATTACA CAAGCTTAAT 
GCAAGAAAAG 

2101 AATGAACAAG AATTATTGGA 
GTTTAACATA 
5 2161 ACAAATTGGC TGTGGTATAT 

GGTAGGTTTA 

22 21 AGAATAGTTT TTGCTGTACT 
TTCAC CATTA 

22 81 TCGTTTCAGA CCCACCTCCC 
10 AATAGAAGAA 

2341 GAAGGTGGAG AGAGAGACAG 
GGCACTGCGT 

24 01 GCGCCAATTC TGCAGACAAA 
AAGGGGGGAT 
15 24 61 TGGGGGGTAC AGTGCAGGGG 

TACAAACTAA 

2521 AGAATTACAA AAACAAATTA 
GGGACAGCAG 

2581 AGATCCAGTT TGGTTAATTA 
20 AATTACGGGG 

2 641 TCATTAGTTC ATAGCCCATA 
AAATGGCCCG 

27 01 CCTGGCTGAC CGCCCAACGA 
TGTTCCCATA 
25 2761 GTAACGCCAA TAGGGACTTT 

GTAAACTGCC 

2821 CACTTGGCAG TACATCAAGT 
CGTCAATGAC 

2 8 81 GGTAAATGGC CCGCCTGGCA 
30 TCCTACTTGG 

2 941 CAGTACATCT ACGTATTAGT 
CACGTTCTGC 

3 0 01 TTCACTCTCC CCATCTCCCC 
TATTT TTTAA 

35 3 0 61 TTATTTTGTG CAGCGATGGG 

CGGGGCGGGG 

3121 CGGGGCGAGG GGCGGGGCGG 
TCAGAGCGGC 

3181 GCGCTCCGAA AGTTTCCTTT 
40 TAAAAAGCGA 

3241 AGCGCGCGGC GGGCGGGGAG 
CGCTCCGCCG 

33 01 CCGCCTCGCG CCGCCCGCCC 
GTGAGCGGGC 

45 '33 61 GGGACGGCCC TTCTCCTCCG 

CTTGTTTCTT 

3421 TTCTGTGGCT GCGTGAAAGC 
GGGGGGAGCG 

34 81 GCTCGGGGGG TGCGTGCGTG 
50 TCCGCGCTGC 

3 541 CCGGCGGCTG TGAGCGCTGC 
AGTGTGCGCG 

3 601 AGGGGAGCGC GGCCGGGGGC 
GGGAACAAAG 
55 3 661 GCTGCGTGCG GGGTGTGTGC 

TCGGTCGGGC 

3721 TGCAACCCCC CCTGCACCCC 
TCGGGTGCGG 



ACACTCCTTA ATTGAAGAAT CGCAAAACCA 
ATTAGATAAA TGGGCAAGTT TGTGGAATTG 
AAAATTATTC ATAATGATAG TAGGAGGCTT 
TTCTATAGTG AATAGAGTTA GGCAGGGATA 
AACCCCGAGG GGACCCGACA GGCCCGAAGG 
AGACAGATCC ATTCGATTAG TGAACGGATC 
TGGCAGTATT CAT C C AC AAT TTTAAAAGAA 
AAAGAATAGT AGACATAATA GCAACAGACA 
CAAAAATTCA AAATTTTCGG GTTTATTACA 
ACTGCAGGAA TCTAGTTATT AATAGTAATC 
TATGGAGTTC CGCGTTACAT AACTTACGGT 
CCCCCGCCCA TTGACGTCAA TAATGACGTA 
CCATTGACGT CAATGGGTGG AGTATTTACG 
GTATCATATG CCAAGTACGC CCCCTATTGA 
TTATGC CCAG TACATGACCT TATGGGACTT 
CATCGCTATT ACCATGGTCG AGGTGAGCCC 
CCCCTCCCCA CCCCCAATTT TGTATTTATT 
GGCGGGGGGG GGGGGGGGGC GCGCGCCAGG 
GGCGAGGCGG AGAGGTGCGG CGGCAGCCAA 
TATGGCGAGG CGGCGGCGGC GGCGGCCCTA 
TCGCTGCGAC GCTGCCTTCG CCCCGTGCCC 
CGGCTCTGAC TGACCGCGTT ACTCCCACAG 
GGCTGTAATT AGCGCTTGGT TTAATGACGG 
CTTGAGGGGC TCCGGGAGGG CCCTTTGTGC 
TGTGTGTGCG TGGGGAGCGC CGCGTGCGGC 
GGGCGCGGCG CGGGGCTTTG TGCGCTCCGC 
GGTGCCCCGC GGTGCGGGGG GGGCTGCGAG 
GTGGGGGGGT GAGCAGGGGG TGTGGGCGCG 
CCTCCCCGAG TTGCTGAGCA CGGCCCGGCT 



Page 103 of 171 



WO 2004/022722 



PCT/US2003/028111 



3 781 GGCTCCGTAC GGGGCGTGGC 
CGGCAGGTGG 

3 841 GGGTGCCGGG CGGGGCGGGG 
GGGCGCGGGG 
5 3 901 GCCCCCGGAG CGCCGGCGGC 

CTTTTATGGT 

3 961 AATCGTGCGA GAGGGCGCAG 
CCGAAATCTG 

4 021 GGAGGCGCCG CCGCACCCCC 
10 GCCGGCAGGA 

40 81 AGGAAATGGG CGGGGAGGGC 
CTCCCTCTCC 

4141 AGCCTCGGGG CTGTCCGGGG 
AGGGCGGGGT 
15 42 01 TCGGCTTCTG GCGTGTGACC 

CATGCCTTCT 

42 61 TCTTTTTCCT ACAGCTCCTG 
CATTTTGGCA 

4321 AAGAATTGAT TTGATACCGC 
20 CCATGGTGAG 

43 81 CAAGGGCGAG GAGCTGTTCA 
ACGGCGACGT 

4441 AAACGG C C AC AAGTTCAGCG 
ACGGCAAGCT 
25 45 01 GACCCTGAAG TTCATCTGCA 

CCCTCGTGAC 

4 5 61 CACCCTGACC TACGGCGTGC 
AGCAGCACGA 

4 621 CTTCTTCAAG TCCGCCATGC 
30 TCTTCAAGGA 

4681 CGACGGCAAC TACAAGACCC 
TGGTGAACCG 

4 741 CATCGAGCTG AAGGGCATCG 
ACAAGCTGGA 
35 4 8 01 GTACAACTAC AACAGCCACA 

ACGGCATCAA 

4 8 61 GGTGAACTTC AAGATCCGCC 
CCGACCACTA 

4 921 CCAGCAGAAC ACCCCCATCG 
40 ACTACCTGAG 

4 981 CACCCAGTCC GCCCTGAGCA 
TCCTGCTGGA 

5041 GTTCGTGACC GCCGCCGGGA 
AAAGCGGCCG 
45 5101 CGACTCTAGA ATTCGATATC 

AAAATTTGTG 

5161 AAAGATTGAC TGGTATTCTT 
TACGCTGCTT 

5221 TAATGCCTTT GTATCATGCT 
50 TCCTTGTATA 

52 81 AATCCTGGTT GCTGTCTCTT 
CGTGGCGTGG 

5341 TGTGCACTGT GTTTGCTGAC 
ACCTGTCAGC 
55 54 01 TCCTTTCCGG GACTTTCGCT 

ATCGCCGCCT 

54 61 GCCTTGCCCG CTGCTGGACA 
GTGGTGTTGT 



GCGGGGCTCG CCGTGCCGGG CGGGGGGTGG 
CCGCCTCGGG CCGGGGAGGG CTCGGGGGAG 
TGTCGAGGCG CGGCGAGCCG CAGCCATTGC 
GGACTTCCTT TGTCCCAAAT CTGTGCGGAG 
TCTAGCGGGC GCGGGGCGAA GCGGTGCGGC 
CTTCGTGCGT CGCCGCGCCG CCGTCCCCTT 
GGGGACGGCT GCCTTCGGGG GGGACGGGGC 
GGCGGCTCTA GAGCCTCTGC TAACCATGTT 
GGCAACGTGC TGGTTATTGT GCTGTCTCAT 
GGGCCCGGGA TCCCCGGGTA CCGGTCGCCA 
CCGGGGTGGT GCCCATCCTG GTCGAGCTGG 
TGTCCGGCGA GGGCGAGGGC GATGCCACCT 
CCACCGGCAA GCTGCCCGTG CCCTGGCCCA 
AGTGCTTCAG CCGCTACCCC GACCACATGA 
CCGAAGGCTA CGTCCAGGAG CGCAGCATCT 
GCGCCGAGGT GAAGTTCGAG GGCGACACCC 
ACTTCAAGGA GGACGGCAAC ATCCTGGGGC 
ACGTCTATAT CATGGCCGAC AAGCAGAAGA 
AC AAC AT CG A GGACGGCAGC GTGCAGCTCG 
GCGACGGCCC CGTGCTGCTG CCCGACAACC 
AAGACCCCAA CGAGAAGCGC GATCACATGG 
TCACTCTCGG CATGGACGAG CTGTACAAGT 
AAGCTTATCG ATAATCAACC TCTGGATTAC 
AACTATGTTG CTCCTTTTAC GCTATGTGGA 
ATTGCTTCCC GTATGGCTTT CATTTTCTCC 
TATGAGGAGT TGTGGCCCGT TGTCAGGCAA 
GCAACCCCCA CTGGTTGGGG CATTGCCACC 
TTCCCCCTCC CTATTGCCAC GGCGGAACTC 
GGGGCTCGGC TGTTGGGCAC TGACAATTCC 
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5521 CGGGGAAATC ATCGTCCTTT 
ATTCTGCGCG 

55 81 GGACGTCCTT CTGCTACGTC 
TCCCGCGGCC 
5 5 641 TGCTGCCGGC TCTGCGGCCT 

AGTCGGATCT 

5 701 CCCTTTGGGC CGCCTCCCCG 
AAACATGGAG 

57 61 CAATCACAAG TAGCAATACA 
10 GAAGCACAAG 

5 821 AGGAGGAGGA GGTGGGTTTT 
ATGACTTACA 

5 881 AGGCAGCTGT AGATCTTAGC 
GGGCTAATTC 
15 5 941 ACTCCCAACG AAGACAAGAT 

GGCTACTTCC 

60 01 CTGATTGGCA GAACTACACA 
TTTGGATGGT 

6061 GCTACAAGCT AGTACCAGTT 
20 GGAGAGAACA 

6121 CCCGCTTGTT ACACCCTGTG 
GAAGTATTAG 

6181 AGTGGAGGTT TGACAGC CGC 
CATCCGGACT 
25 6241 GTACTGGGTC TCTCTGGTTA 

TAACTAGGGA 

63 01 ACCCACTGCT TAAGCCTCAA 
TGTGCCCGTC 

63 61 TGTTGTGTGA CTCTGGTAAC 
30 TGGAAAATCT 

6421 CTAGCAGGGC CCGTTTAAAC 
TTGCCAGCCA 

64 81 TCTGTTGTTT GCCCCTCCCC 
TCCCACTGTC 

35 6541 CTTTCCTAAT AAAATGAGGA 

TTCTATTCTG 

6601 GGGGGTGGGG TGGGGCAGGA 
CAGGCATGCT 

6661 GGGGATGCGG TGGGCTCTAT 
40 CTCTAGGGGG 

6721 TATCCCCACG CGCCCTGTAG 
TACGCGCAGC 

67 81 GTGACCGCTA CACTTGCCAG 
CCCTTCCTTT 
45 6841 CTCGCCACGT TCGCCGGCTT 

TTTAGGGTTC 

69 01 CGATTTAGTG CTTTACGGCA 
TGGTTCACGT 

6961 AGTGGGCCAT CGCCCTGATA 
50 CACGTTCTTT 

7 021 AATAGTGGAC TCTTGTTCCA 
CTATTCTTTT 

70 81 GATTTATAAG GGATTTTGCC 
GATTTAACAA 

55 7141 AAAT TTAACG CGAATTAATT 

AAGTCCCCAG 

72 01 GCTCCCCAGC AGGCAGAAGT 
ACCAGGTGTG 



CCTTGGCTGC TCGCCTGTGT TGCCACCTGG 
CCTTCGGCCC TCAATCCAGC GGACCTTCCT 
CTTCCGCGTC TTCGCCTTCG CCCTCAGACG 
CAT CGAT ACC GTCGACCTCG AGAC CTAGAA 
GCAGCTACCA ATGCTGATTG TGCCTGGCTA 
CCAGTCACAC CTCAGGTACC TTTAAGACCA 
CACTTTTTAA AAGAAAAGGG GGGACTGGAA 
ATCCTTGATC TGTGGATCTA CCACACACAA 
CCAGGGCCAG GGATCAGATA TCCACTGACC 
GAGCAAGAGA AGGTAGAAGA AGCCAATGAA 
AGCCTGCATG GGATGGATGA CCCGGAGAGA 
CTAGCATTTC ATCACATGGC CCGAGAGCTG 
GACCAGATCT GAGCCTGGGA GCTCTCTGGC 
TAAAGCTTGC CTTGAGTGCT TCAAGTAGTG 
TAGAGATCCC TCAGACCCTT TTAGTCAGTG 
CCGCTGATCA GCCTCGACTG TGCCTTCTAG 
CGTGCCTTCC TTGACCCTGG AAGGTGCCAC 
AATTGCATCG CATTGTCTGA GTAGGTGTCA 
CAGCAAGGGG GAGGATTGGG AAGACAATAG 
GGCTTCTGAG GCGGAAAGAA CCAGCTGGGG 
CGGCGCATTA AGCGCGGCGG GTGTGGTGGT 
CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT 
TCCCCGTCAA GCTCTAAATC GGGGGCTCCC 
CCTCGACCCC AAAAAACTTG ATTAGGGTGA 
GACGGTTTTT CGCCCTTTGA CGTTGGAGTC 
AACTGGAACA ACACTCAACC CTATCTCGGT 
GATTTCGGCC TATTGGTTAA AAAATGAGCT 
CTGTGGAATG TGTGTCAGTT AGGGTGTGGA 
ATGCAAAGCA TGCATCTCAA TTAGTCAGCA 
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72 61 GAAAGTCCCC AGGCTCCCCA 
AATTAGTCAG 

7321 CAACCATAGT CCCGCCCCTA 
AGTTCCGCCC 
5 73 81 ATTCTCCGCC CCATGGCTGA 

GCCGCCTCTG 

7441 CCTCTGAGCT ATTC CAGAAG 
TTTTGCAAAA 

75 01 AGCTCCCGGG AGCTTGTATA 
10 ACAATTAATC 

7561 ATCGGCATAG TATATCGGCA 
CCATGGCCAA 

7 621 GTTGAC CAGT GCCGTTCCGG 
TCGAGTTCTG 
15 7 681 GACCGACCGG CTCGGGTTCT 

GTGTGGTCCG 

7741 GGACGACGTG ACCCTGTTCA 
ACAACACCCT 

7 801 GGCCTGGGTG TGGGTGCGCG 
20 AGGTCGTGTC 

7 861 CACGAACTTC CGGGACGCCT 
AGCCGTGGGG 

7 921 GCGGGAGTTC GCCCTGCGCG 
CCGAGGAGCA 

25 7 981 GGACTGACAC GTGCTACGAG 

GGTTGGGCTT 

8 041 CGGAATCGTT TTCCGGGACG 
TCATGCTGGA 

8101 GTTCTTCGCC CACCCCAACT 
30 AAAGCAATAG 

8161 CATCACAAAT TTCACAAATA 
GTTTGTCCAA 

8221 ACTCATCAAT GTATCTTATC 
GCTTGGCGTA 
35 82 81 ATCATGGTCA TAGCTGTTTC 

CACACAACAT 

8341 ACGAGCCGGA AGCATAAAGT 
AACTCACATT 

84 01 AATTGCGTTG CGCTCACTGC 
40 AGCTGCATTA 

84 61 ATGAATCGGC CAACGCGCGG 
CCGCTTCCTC 

8521 GCTCACTGAC TCGCTGCGCT 
CTCACTCAAA 
45 8581 GGCGGTAATA CGGTTATCCA 

TGTGAGCAAA 

8 641 AGGCCAGCAA AAGGCCAGGA 
TCCATAGGCT 

8 7 01 CCGCCCCCCT GACGAGCATC 
50 GAAAC C CGAC 

8761 AGGACTATAA AGATACCAGG 
CTCCTGTTCC 

8 821 GACCCTGCCG CTTACCGGAT 
TGGCGCTTTC 
55 8 8 81 TCATAGCTCA CGCTGTAGGT 

AGCTGGGCTG 

8 941 TGTGCACGAA CCCCCCGTTC 
ATCGTCTTGA 



GCAGGCAGAA GTATGCAAAG CATGCATCTC 
ACTCCGCCCA TCCCGCCCCT AACTCCGCCC 
CTAATTTTTT TTATTTATGC AGAGGC CGAG 
TAGTGAGGAG GCTTTTTTGG AGGCCTAGGC 
TCCATTTTCG GATCTGATCA GCACGTGTTG 
TAGTATAATA CGACAAGGTG AGGAACTAAA 
TGCTCACCGC GCGCGACGTC GCCGGAGCGG 
CCCGGGACTT CGTGGAGGAC GACTTCGCCG 
TCAGCGCGGT CCAGGACCAG GTGGTGCCGG 
GCCTGGACGA GCTGTACGCC GAGTGGTCGG 
CCGGGCCGGC CATGACCGAG ATCGGCGAGC 
ACCCGGCCGG CAACTGCGTG CACTTCGTGG 
ATTTCGATTC CACCGCCGCC TTCTATGAAA 
CCGGCTGGAT GATCCTCCAG CGCGGGGATC 
TGTTTATTGC AGCTTATAAT GGTTACAAAT 
AAGCATTTTT TTCACTGCAT TCTAGTTGTG 
ATGTCTGTAT ACCGTCGACC TCTAGCTAGA 
CTGTGTGAAA TTGTTATCCG CTCACAATTC 
GTAAAGCCTG GGGTGCCTAA TGAGTGAGCT 
CCGCTTTCCA GTCGGGAAAC CTGTCGTGCC 
GGAGAGGCGG TTTGCGTATT GGGCGCTCTT 
CGGTCGTTCG GCTGCGGCGA GCGGTATCAG 
CAGAATCAGG GGATAACGCA GGAAAGAACA 
ACCGTAAAAA GGCCGCGTTG CTGGCGTTTT 
ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 
CGTTTCCCCC TGGAAGCTCC CTCGTGCGCT 
ACCTGTCCGC CTTTCTCCCT TCGGGAAGCG 
ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA 
AGCCCGACCG CTGCGCCTTA TCCGGTAACT 
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90 01 GTCCAACCCG GTAAGACACG 
ACAGGATTAG 

9061 CAGAGCGAGG TATGTAGGCG 
ACTACGGCTA 
5 9121 CACTAGAAGA ACAGTATTTG 

TCGGAAAAAG 

9181 AGTTGGTAGC TCTTGATCCG 
TTTTTGTTTG 

9241 CAAGCAGCAG ATTACGCGCA 
10 TCTTTTCTAC 

93 01 GGGGTCTGAC GCTCAGTGGA 
TGAGATTATC 

93 61 AAAAAGGATC TTCACCTAGA 
CAATCTAAAG 

15 9421 TATATATGAG TAAACTTGGT 

CACCTATCTC 

94 81 AGCGATCTGT CTATTTCGTT 
AGATAACTAC 

9541 GATACGGGAG GGCTTACCAT 
20 ACCCACGCTC 

9601 ACCGGCTCCA GATTTATCAG 
GCAGAAGTGG 

9661 TCCTGCAACT TTATCCGCCT 
CTAGAGTAAG 
25 9721 TAGTTCGCCA GTTAATAGTT 

TCGTGGTGTC 

97 81 ACGCTCGTCG TTTGGTATGG 
GGCGAGTTAC 

9841 ATGATCCCCC ATGTTGTGCA 
30 TCGTTGTCAG 

99 01 AAGTAAGTTG GCCGCAGTGT 
ATTCTCTTAC 

9961 TGTCATGCCA TCCGTAAGAT 
AGTCATTCTG 
35 10 021 AGAATAGTGT ATGCGGCGAC 

ATAATACCGC 

100 81 GCCACATAGC AGAACTTTAA 
GGCGAAA&CT 

10141 CTCAAGGATC TTACCGCTGT 
40 CACCCAACTG 

102 01 ATCTTCAGCA TCTTTTACTT 
GAAGGCAAAA 

10261 TGCCGCAAAA AAGGGAATAA 
TCTTCCTTTT 
45 10321 TCAATATTAT TGAAGCATTT 

TATTTGAATG 

10381 TATTTAGAAA AATAAACAAA 
TGCCACCTGA 
10441 C 

50 // 

(SEQ ID NO: 1) 



PCT/US2003/028111 

ACTTATCGCC ACTGGCAGCA GCCACTGGTA 
GTGCTACAGA GTTCTTGAAG TGGTGGCCTA 
GTATCTGCGC TCTGCTGAAG CCAGTTACCT 
GCAAACAAAC CACCGCTGGT AGCGGTGGTT 
GAAAAAA&GG ATCTCAAGAA GATCCTTTGA 
ACGAAAACTC ACGTTAAGGG ATTTTGGTCA 
TCCTTTTAAA TTAAAAATGA AGTTTTAAAT 
CTGACAGTTA CCAATGCTTA AT C AGTGAGG 
CATC CAT AGT TGCCTGACTC CCCGTCGTGT 
CTGGCCCCAG TGCTGCAATG AT AC CGCGAG 
CAATAAACCA GCCAGCCGGA AGGGCCGAGC 
CCATCCAGTC TATTAATTGT TGCCGGGAAG 
TGCGCAACGT TGTTGCCATT GCTACAGGCA 
CTTCATTCAG CTCCGGTTCC CAACGATCAA 
AAAAAGCGGT TAGCTCCTTC GGTCCTCCGA 
TATCACTCAT GGTTATGGCA GCACTGCATA 
GCTTTTCTGT GACTGGTGAG TACTCAACCA 
CGAGTTGCTC TTGCCCGGCG TCAATACGGG 
AAGTGCTCAT CATTGGAAAA CGTTCTTCGG 
TGAGATCCAG TTCGATGTAA CCCACTCGTG 
TCACCAGCGT TTCTGGGTGA GCAAAAACAG 
GGGCGACACG GAAATGTTGA ATACTCATAC 
ATCAGGGTTA TTGTCTCATG AG CGGATACA 
TAGGGGTTCC GCGCACATTT CCCCGAAAAG 
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[00345] pLL3.0 

LOCUS PLENTILOX 6027 BP DS-DNA CIRCULAR SYN 23- JAN - 

5 2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 

10 FEATURES Location/Qualifiers 
promoter 212 . .816 

/note="CMV promoter/ enhancer 1" 
gene 5034.. 5894 

/note= "AmpR" 
15 rep_origin 4216.. 4889 

/note="pUC" 
misc_recomb 2710. .2743 

/note="LoxP" 
misc_recomb 2827. .2860 

20 /note="LoxP" 
LTR 835 . . 1509 

/note="5' HIV R-U5-del gag (HIV NL4-3/454 -1126) " 
misc__f eature 1539. .2396 

/note="HIV RRE (HIV NL4 -3/7622 - 8459) " 
25 misc_feature 2422.. 2599 

/note="HIV Flap" 
misc_feature 2915. .3504 

/note="WRE element" 
LTR 3524 . .4213 

30 /note="3' SIN LTR" 

BASE COUNT 1612 A 1408 C 1518 G 1489 T 0 OTHER 

ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

35 61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 

GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGC CAGAT AT 
40 ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
C ATAGC C CAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

45 3 61 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTC C CAT AGTAACGCCA 

ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

4 81 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
50 CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
TACGT AT TAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 

55 661 TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 

TTGTTTTGGC 

721 ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 
ACGCAAATGG 
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781 GCGGTAGGCG TGTACGGTGG 
TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG 
CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC 
GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT 
TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA 
CGCAGGACTC 

10 81 GGCTTGCTGA AGCGCGCACG 
ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGC TAG 
ATTAAGCGGG 

12 01 GGAGAATTAG ATCGCGATGG 
AAAAAATATA 

12 61 AATTAAAACA TATAGTATGG 
AATCCTGGCC 

13 21 TGTTAGAAAC ATCAGAAGGC 
TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA 
TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC 
GAGCAAAACA 

15 01 AAAGTAAGAC CACCGCACAG 
TG GAGGAGG A 

1561 GATATGAGGG ACAATTGGAG 
AATTGAAC C A 

1621 TTAGGAGTAG CACCCACCAA 
AAGAGCAGTG 

1681 GGAATAGGAG CTTTGTTCCT 
GGGCGCAGCG 

1741 TCAATGACGC TGACGGTACA 
GCAGCAGAAC 

1801 AATTTGCTGA GGGCTATTGA 
CTGGGGCATC 

1861 AAGCAGCTCC AGGCAAGAAT 
ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
GAATGCTAGT 

1981 TGGAGTAATA AATCTCTGGA 
GTGGGACAGA 

2 041 GAAATTAACA ATTACACAAG 
AAACCAGCAA 

2101 GAAAAGAATG AACAAGAATT 
GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
AGGCTTGGTA 

2221 GGTTTAAGAA TAGTTTTTGC 
GGGATATTCA 

22 81 CCATTATCGT TTCAGACCCA 
CGAAGGAATA 

2341 GAAGAAGAAG GTGGAGAGAG 
CGGATCGGCA 

24 01 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 

24 61 GGGGATTGGG GGGTACAGTG 
CAGACATACA 



GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TACTGGGACA GCTACAACCA 
TCATTATATA ATACAGTAGC AACCCTCTAT 
AC C AAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
AAGTGAATTA TATAAATATA AAGTAGTAAA 
GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGCCAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGCTGTG GAAAGAT AC C TAAAGGATCA 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAAT CGC A 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
GACAAATGGC AGTATTCATC CACAATTTTA 
CAGGGGAAAG AATAGTAGAC ATAATAGCAA 



Page 109 of 171 



WO 2004/022722 



PCT/US2003/028111 



2521 AACTAAAGAA TTACAAAAAC 
ATTACAGGGA 

2 581 CAGCAGAGAT CCAGTTTGGT 
CGCTAGCCGT 

2 641 TAATTAAGCC TCGAGGTCGA 
CAGCAGGTCG 

27 01 AGGGACCTAA TAACTTCGTA 
GTTCCAAGCT 

2 7 61 TAAGCGGCCG CCGATGCATG 
TTGGTCGAGG 

2 821 GACCTAATAA CTTCGTATAG 
GGGTTCCGGT 

2 881 TCCACTAGGT ACAATTCGAT 
TACAAAATTT 

2 941 GTGAAAGATT GACTGGTATT 
GGATACGCTG 

3 001 CTTTAATGCC TTTGTATCAT 
TCCTCCTTGT 

3 061 ATAAATCCTG GTTGCTGTCT 
CAACGTGGCG 

3121 TGGTGTGCAC TGTGTTTGCT 
ACCACCTGTC 

3181 AGCTCCTTTC CGGGACTTTC 
CTCATCGCCG 

3241 CCTGCCTTGC CCGCTGCTGG 
TCCGTGGTGT 

33 01 TGTCGGGGAA ATCATCGTCC 
TGGATTCTGC 

3 3 61 GCGGGACGTC CTTCTGCTAC 
CCTTCCCGCG 

3421 GCCTGCTGCC GGCTCTGCGG 
ACGAGTCGGA 

34 81 TCTCCCTTTG GGCCGCCTCC 
CCTAGAAAAA 

3 541 CATGGAGCAA TCACAAGTAG 
CTGGCTAGAA 

3 601 GCACAAGAGG AGGAGGAGGT 
AAGACCAATG 

3 661 ACTTACAAGG CAGCTGTAGA 
ACTGGAAGGG 

3 721 CTAATTCACT CCCAACGAAG 
CACACAAGGC 

3 781 TACTTCCCTG ATTGGCAGAA 
ACTGACCTTT 

3 841 GGATGGTGCT ACAAGCTAGT 
CAATGAAGGA 

3 901 GAGAACACCC GCTTGTTACA 
GGAGAGAGAA 

3 961 GTATTAGAGT GGAGGTTTGA 
AGAGCTGCAT 

4 021 CCGGACTGTA CTGGGTCTCT 
CTCTGGCTAA 

4081 CTAGGGAACC CACTGCTTAA 
AGTAGTGTGT 

4141 GCCCGTCTGT TGTGTGACTC 
GTCAGTGTGG 

42 01 AAAATCTCTA GCAGCATGTG 
TAAAAAGGCC 



AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGTAC CGGG CCCGCTCTAG ACGGTTAACG 
CGGTATCGAT AAGCTCGCTT CACGAGATTC 
TAGCATACAT TATACGAAGT TATATTAAGG 
CCCCGGGATG GCGCGCCATG GATCCGCGAA 
CATACATTAT ACGAAGTTAT ACATGTTTAA 
ATCAAGCTTA T CGATAAT C A ACCTCTGGAT 
CTTAACTATG TTGCTCCTTT TACGCTATGT 
GCTATTGCTT CCCGTATGGC TTTCATTTTC 
CTTTATGAGG AGTTGTGGCC CGTTGTCAGG 
GACGCAACCC CCACTGGTTG GGGCATTGCC 
GCTTTCCCCC TCCCTATTGC CACGGCGGAA 
ACAGGGGCTC GGCTGTTGGG CACTGACAAT 
TTTCCTTGGC TGCTCGCCTG TGTTGCCACC 
GTCCCTTCGG CCCTCAATCC AGCGGACCTT 
CCTCTTCCGC GTCTTCGCCT TCGCCCTCAG 
CCGCATCGAT ACCGTCGACC TCGATCGAGA 
C AAT AC AG C A GCTACCAATG CTGATTGTGC 
GGGTTTTCCA GTCACACCTC AGGTACCTTT 
TCTTAGCCAC TTTTTAAAAG AAAAGGGGGG 
ACAAGATATC CTTGATCTGT GGATCTACCA 
CTACACACCA GGGCCAGGGA TCAGATATCC 
AC C AGTTGAG CAAGAGAAGG TAGAAGAAGC 
CCCTGTGAGC CTGCATGGGA TGGATGACCC 
CAGCCGCCTA GCATTTCATC ACATGGCCCG 
CTGGTTAGAC CAGATCTGAG CCTGGGAGCT 
GCCTCAATAA AGCTTGCCTT GAGTGCTTCA 
TGGTAACTAG AGATCCCTCA GACCCTTTTA 
AGCAAAAGGC CAGCAAAAGG CCAGGAACCG 
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42 61 GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG AGCATCACAA 
AAATCGACGC 

43 21 TCAAGTCAGA GGTGGCGAAA CCCGACAGGA CTATAAAGAT ACCAGGCGTT 
TCCCCCTGGA 

5 43 81 AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA C CGGATAC CT 

GTCCGCCTTT 

4441 CTCCCTTCGG GAAGCGTGGC GCTTTCTCAT AGCTCACGCT GTAGGTATCT 
CAGTTCGGTG 

45 01 TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC 
10 CGACCGCTGC 

45 61 GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA GACACGACTT 
ATCGCCACTG 

4 621 GCAGCAGCCA CTGGTAACAG GATTAG C AG A GCGAGGTATG TAGGCGGTGC 
TACAGAGTTC 

15 4 681 TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGAACAG TATTTGGTAT 

CTGCGCTCTG 

4741 CTGAAGCCAG TTACCTTCGG AAAAAGAGTT GGTAGCTCTT GATCCGGCAA 
ACAAACCACC 

4 801 GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 
20 AAAAGGATCT 

4 861 CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC AGTGGAACGA 
AAACTCACGT 

4921 TAAGGGATTT TGGTCATGAG ATTATCAAAA AGGATCTTCA CCTAGATCCT 
TTTAAATTAA 

25 4981 AAATGAAGTT TTAAATCAAT CTAAAGTATA TATGAGTAAA CTTGGTCTGA 

CAGTTACCAA 

5 041 TGCTTAATCA GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC 
CATAGTTGCC 

5101 TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT TACCATCTGG 
30 CCCCAGTGCT 

5161 GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT TATCAGCAAT 
AAACCAGCCA 

5221 GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT GCAACTTTAT CCGCCTCCAT 
CCAGTCTATT 

35 52 81 AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA ATAGTTTGCG 

CAACGTTGTT 

5341 GCCATTGCTA CAGGCATCGT GGTGTCACGC TCGTCGTTTG GTATGGCTTC 
ATTCAGCTCC 

5401 GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT TGTGCAAAAA 
40 AGCGGTTAGC 

5461 TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG CAGTGTTATC 
ACTCATGGTT 

5521 ATGGCAGCAC TGCATAATTC TCTTACTGTC ATGCCATCCG TAAGATGCTT 
TTCTGTGACT 

45 5581 GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC GGCGACCGAG 

TTGCTCTTGC 

5641 CCGGCGTCAA TACGGGATAA TACCGCGCCA CATAGCAGAA CTTTAAAAGT 
GCTCATCATT 

5 701 GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC CGCTGTTGAG 
50 ATCCAGTTCG 

5761 ATGTAAC C C A CTCGTGCACC CAACTGATCT TCAGCATCTT TTACTTTCAC 
CAGCGTTTCT 

5821 GGGTGAGCAA AAACAGGAAG GCAAAATGCC GCAAAAAAGG GAATAAGGGC 
GACACGGAAA 

55 5 881 TGTTGAATAC TCATACTCTT CCTTTTTCAA TATTATTGAA GCATTTATCA 

GGGTTATTGT 

5 941 CTCATGAGCG GATACATATT TGAATGTATT TAGAAAAATA AACAAATAGG 
GGTTCCGCGC 

60 01 ACATTTCCCC GAAAAGTGCC ACCTGAC 
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[00346] pLLS.l 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



LOCUS 
2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 

promoter 



PLENTILOX 



6748 BP DS-DNA 



CIRCULAR SYN 



23- JAN- 



misc_recomb 
gene 

rep_origin 

misc_recomb 

LTR 

mis cofeature 
mi sc_f eature 
mi sc_f eature 
LTR 
gene 



Location/ Qualifiers 
212 . .816 

/note="CMV promoter/ enhancer 1" 
3548. .3581 
/note=»LoxP" 
5755. .6615 
/note="AmpR" 
4937 . . 5610 
/note="pUC" 
2710. .2745 
/note="LoxP" 
835 . . 1509 

/note="5' HIV R-U5-del gag (HIV NL4 -3/454 - 1126) » 
1539. .2396 

/note="HIV RRE (HIV NL4-3/7622-8459) » 
2422 . .2599 
/note="HIV Flap" 
3636. .4225 
/note="WRE element" 
4245 . .4934 
/note= M 3 ' SIN LTR " 
2772 . .3494 
/ note= IT EGFP " 

BASE COUNT 1785 A 1651 C 1721 G 1591 T 0 OTHER 

ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGCC CAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

3 61 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTT C C CAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

4 81 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGT C AATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 

661 TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 
TTGTTTTGGC 
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721 ACCAAAATCA ACGGGACTTT 
ACGCAAATGG 

781 GCGGTAGGCG TGTACGGTGG 
TACTGGGTCT 
5 841 CTCTGGTTAG ACCAGATCTG 

CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC 
GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT 
10 TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA 
CGCAGGACTC 

10 81 GGCTTGCTGA AGCGCGCACG 
ACGCCAAAAA 
15 1141 TTTTGACTAG CGGAGGCTAG 

ATTAAGCGGG 

12 01 GGAGAATTAG ATCGCGATGG 
AAAAAATATA 

12 61 AATTAAAACA TATAGTATGG 
20 AATCCTGGCC 

1321 TGTTAGAAAC ATCAGAAGGC 
TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA 
TGTGTGCATC 

25 1441 AAAGGATAGA GATAAAAGAC 

GAGCAAAACA 

15 01 AAAGTAAGAC CACCGCACAG 
TGGAGGAGGA 

1561 GATATGAGGG ACAATTGGAG 
30 AATTGAACCA 

1621 TTAGGAGTAG CACCCACCAA 
AAGAGCAGTG 

1681 GGAATAGGAG CTTTGTTCCT 
GGGCGCAGCG 
35 1741 TCAATGACGC TGACGGTACA 

GCAGCAGAAC 

18 01 AATTTGCTGA GGGCTATTGA 
CTGGGGCATC 

18 61 AAGCAGCTCC AGGCAAGAAT 
40 ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
GAATGCTAGT 

1981 TGGAGTAATA AATCTCTGGA 
GTGGGACAGA 
45 2 041 GAAATTAACA ATTACACAAG 

AAAC CAGCAA 

2101 GAAAAGAATG AACAAGAATT 
GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
50 AGGC TTGGT A 

2221 GGTTTAAGAA TAGTTTTTGC 
GGGATATTCA 

22 81 CCATTATCGT TTCAGACCCA 
CGAAGGAATA 
55 2 341 GAAGAAGAAG GTGGAGAGAG 

CGGATCGGCA 

24 01 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 



CCAAAATGTC GTAACAACTC CGCCCCATTG 
GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TACTGGGACA GCTACAACCA 
TCATTATATA ATACAGTAGC AACCCTCTAT 
ACCAAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
AAGTGAATTA TATAAATATA AAGTAGTAAA 
GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGC AG CAG GAAGCAC TAT 
GGC CAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGGTGTG GAAAG AT AC C TAAAGGATCA 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAATCGCA 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
GACAAATGGC AGTATTCATC CACAATTTTA 
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24 61 GGGGATTGGG GGGTACAGTG 
CAGACATACA 

2521 AACTAAAGAA TTACAAAAAC 
ATTACAGGGA 
5 25 81 CAGCAGAGAT CCAGTTTGGT 

CGCTAGCCGT 

2 641 TAATTAAGCC TCGAGGTCGA 
CAGCAGGTCG 

27 01 AGGGACCTAA TAACTTCGTA 
10 GTTCCAAGCT 

2 761 TAAGCGGCCG CGCCACCATG 
GTGGTGCCCA 

2 821 TCCTGGTCGA GCTGGACGGC 
GGCGAGGGCG 
15 2 8 81 AGGGCGATGC CACCTACGGC 

GGCAAGCTGC 

2 941 CCGTGCCCTG GCCCACCCTC 
TTCAGCCGCT 

3 0 01 ACCCCGACCA CATGAAGCAG 
20 GGCTACGTCC 

3 0 61 AGGAGCGCAC CATCTTCTTC 
GAGGTGAAGT 

3121 TCGAGGGCGA CACCCTGGTG 
AAGGAGGACG 
25 3181 GCAACATCCT GGGGCACAAG 

TATATCATGG 

3 241 CCGACAAGCA GAAGAACGGC 
ATCGAGGACG 

33 01 GCAGCGTGCA GCTCGCCGAC 
30 GGCCCCGTGC 

3 3 61 TGCTGCCCGA CAACCACTAC 
CCCAACGAGA 

3421 AGCGCGATCA CATGGTCCTG 
CTCGGCATGG 
35 34 81 ACGAGCTGTA CAAGATGCAT 

ATTCGTCGAG 

3 541 GGACCTAATA ACTTCGTATA 
AGGGTTCCGG 

3 601 TTCCACTAGG TACAATTCGA 
40 TTACAAAATT 

3661 TGTGAAAGAT TGACTGGTAT 
TGGATACGCT 

3 721 GCTTTAATGC CTTTGTATCA 
CTCCTCCTTG 
45 3 781 TATAAATCCT GGTTGCTGTC 

GCAACGTGGC 

3 841 GTGGTGTGCA CTGTGTTTGC 
CACCACCTGT 

3901 CAGCTCCTTT CCGGGACTTT 
50 ACTCATCGCC 

3 961 GCCTGCCTTG CCCGCTGCTG 
TTCCGTGGTG 

4 021 TTGTCGGGGA AATCATCGTC 
CTGGATTCTG 

55 4081 CGCGGGACGT CCTTCTGCTA 

TCCTTCCCGC 

4141 GGCCTGCTGC CGGCTCTGCG 
GACGAGTCGG 



CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGTAC CGGG CCCGCTCTAG ACGGTTAACG 
CGGTATCGAT AAGCTCGCTT CACGAGATTC 
TAGCATACAT TATACGAAGT TATATTAAGG 
GTGAGCAAGG GCGAGGAGCT GTTCACCGGG 
GACGTAAACG GCCACAAGTT CAGCGTGTCC 
AAGCTGACCC TGAAGTTCAT CTGCACCACC 
GTGACCACCC TGACCTACGG CGTGCAGTGC 
CACGACTTCT TCAAGTCCGC CATGCCCGAA 
AAGGAGGACG GCAACTACAA GACCCGCGCC 
AACCGCATCG AGCTGAAGGG CATCGACTTC 
CTGGAGTACA ACTACAACAG CCACAACGTC 
ATCAAGGTGA ACTTCAAGAT CCGCCACAAC 
CACTACCAGC AGAACACCCC CATCGGCGAC 
CTGAGCACCC AGTCCGCCCT GAG C AAAGAC 
CTGGAGTTCG TGACCGCCGC CGGGATCACT 
GCCCCGGGAT GGCGCGCCAT GGATCCGCGA 
GCATACATTA TACGAAGTTA TACATGTTTA 
TATCAAGCTT ATCGATAATC AACCTCTGGA 
TCTTAACTAT GTTGCTCCTT TTACGCTATG 
TGCTATTGCT TCCCGTATGG CTTTCATTTT 
TCTTTATGAG GAGTTGTGGC CCGTTGTCAG 
TGACGCAACC CCCACTGGTT GGGGCATTGC 
CGCTTTCCCC CTCCCTATTG CCACGGCGGA 
GACAGGGGCT CGGCTGTTGG GCACTGACAA 
CTTTCCTTGG CTGCTCGCCT GTGTTGCCAC 
CGTCCCTTCG GCCCTCAATC CAGCGGACCT 
GCCTCTTCCG CGTCTTCGCC TTCGCCCTCA 
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42 01 ATCTCCCTTT GGGCCGCCTC 
ACCTAGAAAA 

42 61 ACATGGAGCA ATCACAAGTA 
CCTGGCTAGA 

5 4321 AG C AC AAG AG GAGGAGGAGG 

TAAGACCAAT 

43 81 GACTTACAAG GCAGCTGTAG 
GAC TGGAAGG 

4441 GCTAATTCAC TCCCAACGAA 
10 ACACACAAGG 

4 501 CTACTTCCCT GATTGGCAGA 
CACTGACCTT 

4561 TGGATGGTGC TACAAGCTAG 
CCAATGAAGG 
15 4 621 AG AG AAC AC C CGCTTGTTAC 

CGGAGAGAGA 

4681 AGTATTAGAG TGGAGGTTTG 
GAGAGCTGCA 

4 741 TCCGGACTGT ACTGGGTCTC 
20 TCTCTGGCTA 

4801 ACTAGGGAAC CCACTGCTTA 
AAGTAGTGTG 

4 861 TGCCCGTCTG TTGTGTGACT 
AGTCAGTGTG 
25 4 921 GAAAATCTCT AGCAGCATGT 

GTAAAAAGGC 

4 981 CGCGTTGCTG GCGTTTTTCC 
AAAATCGACG 

5041 CTCAAGTCAG AGGTGGCGAA 
30 TTCCCCCTGG 

5101 AAGCTCCCTC GTGCGCTCTC 
TGTCCGCCTT 

5161 TCTCCCTTCG GGAAGCGTGG 
TCAGTTCGGT 
35 5221 GTAGGT CGTT CGCTCCAAGC 

CCGACCGCTG 

52 81 CGCCTTATCC GGTAAC TAT C 
TATCGCCACT 

5341 GGCAGCAGCC ACTGGTAACA 
40 CTACAGAGTT 

54 01 CTTGAAGTGG TGGCCTAACT 
TCTGCGCTCT 

5461 GCTGAAGCCA GTTACCTTCG 
AACAAACCAC 
45 5521 CGCTGGTAGC GGTGGTTTTT 

AAAAAGGATC 

55 81 TCAAGAAGAT CCTTTGATCT 
AAAACTCACG 

5 641 TTAAGGGATT TTGGTCATGA 
50 TTTTAAATTA 

57 01 AAAATGAAGT TTTAAATCAA 
ACAGTTACCA 

5761 ATGCTTAATC AGTGAGGCAC 
CCATAGTTGC 
55 5 821 CTGACTCCCC GTCGTGTAGA 

GCCCCAGTGC 

5 881 TGCAATGATA CCGCGAGACC 
TAAACCAGCC 



CCCGCATCGA TACCGTCGAC CTCGATCGAG 
GCAATACAGC AGC TACCAAT GCTGATTGTG 
TGGGTTTTCC AGTCACACCT CAGGTACCTT 
ATCTTAGCCA CTTTTTAAAA GAAAAGGGGG 
GACAAGATAT CCTTGATCTG TGGATCTACC 
ACTACACACC AGGGCCAGGG AT CAGATATC 
TACCAGTTGA GCAAGAGAAG GTAGAAGAAG 
ACCCTGTGAG CCTGCATGGG ATGGATGACC 
ACAGCCGCCT AGCATTTCAT CACATGGCCC 
TCTGGTTAGA CCAGATCTGA GCCTGGGAGC 
AGCCTCAATA AAGCTTGCCT TGAGTGCTTC 
CTGGTAACTA GAGATCCCTC AGACCCTTTT 
GAGCAAAAGG CCAGCAAAAG GCCAGGAACC 
ATAGGCTCCG CCCCCCTGAC GAGCATCACA 
ACCCGACAGG ACTATAAAGA T AC CAGGCGT 
CTGTTCCGAC CCTGCCGCTT AC C GGAT AC C 
CGCTTTCTCA TAGCTCACGC TGTAGGTATC 
TGGGCTGTGT GCACGAACCC CCCGTTCAGC 
GTCTTGAGTC CAACCCGGTA AGACACGACT 
GGATTAGCAG AGC GAGGTAT GTAGGCGGTG 
ACGGCTACAC TAGAAGAACA GTATTTGGTA 
GAAAAAGAGT TGGTAGCTCT TGATCCGGCA 
TTGTTTGCAA GCAGCAGATT ACGCGCAGAA 
TTTCTACGGG GTCTGACGCT CAGTGGAACG 
GATTATCAAA AAGGATCTTC ACCTAGATCC 
TCTAAAGTAT ATATGAGTAA ACTTGGTCTG 
CTATCTCAGC GATCTGTCTA TTTCGTTCAT 
TAACTACGAT ACGGGAGGGC TTACCATCTG 
CACGCTCACC GGCTCCAGAT TTATCAGCAA 
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5941 AGCCGGAAGG GCCGAGCGCA 
TCCAGTCTAT 

60 01 TAATTGTTGC CGGGAAGCTA 
GCAACGTTGT 
5 6061 TGCCATTGCT ACAGGCATCG 

CATTCAGCTC 

6121 CGGTTCCCAA CGATCAAGGC 
AAGCGGTTAG 

6181 CTCCTTCGGT CCTCCGATCG 
10 CACTCATGGT 

6241 TATGGCAGCA CTGCATAATT 
TTTCTGTGAC 

63 01 TGGTGAGTAC TCAACCAAGT 
GTTGCTCTTG 

15 63 61 CCCGGCGTCA ATACGGGATA 

TGCTCATCAT 

6421 TGGAAAACGT TCTTCGGGGC 
GATCCAGTTC 

64 81 GATGTAACCC ACTCGTGCAC 
20 CCAGCGTTTC 

6541 TGGGTG AG C A AAAACAGGAA 
CGACACGGAA 

6601 ATGTTGAATA CTCATACTCT 
AGGGTTATTG 
25 6661 TCTCATGAGC GGATACATAT 

GGGTTCCGCG 

6721 CACATTTCCC CGAAAAGTGC 

// 

30 (SEQ ID NO: 3) 



GAAGTGGTCC TGCAACTTTA TCCGCCTCCA 
GAGTAAGTAG TTCGCCAGTT AATAGTTTGC 
TGGTGTCACG CTCGTCGTTT GGTATGGCTT 
GAGTTACATG ATCCCCCATG TTGTGCAAAA 
TTGTCAGAAG TAAGTTGGCC GGAGTGTTAT 
CTCTTACTGT CATGGCATCC GTAAGATGCT 
CATTCTGAGA ATAGTGTATG CGGCGACCGA 
ATACCGCGCC ACATAGCAGA ACTTTAAAAG 
GAAAACTCTC AAGGATCTTA CCGCTGTTGA 
CCAACTGATC TTCAGCATCT TTTACTTTCA 
GGCAAAATGC CGCAAAAAAG GGAATAAGGG 
TCCTTTTTCA ATATTATTGA AGCATTTATC 
TTGAATGTAT TTAGAAAAAT AAACAAATAG 
CACCTGAC 
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[00347] pLL3.2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



LOCUS 
2002 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES 

promoter 



misc recomb 



PLENTILOX 



67 06 BP DS-DNA CIRCULAR SYN 



23-JAN- 



gene 

rep_origin 

misc_recomb 

LTR 

misc feature 



misc feature 



Location/Qualifiers 
212 . . 816 

/note="CMV promoter/enhancer 1" 

3506. .3539 

/note="LoxP" 

5713 . .6573 

/note="AmpR" 

4895. .5568 

/note="pUC n 

2710. .2745 

/note= M LoxP " 

835 . . 1509 

/note="5' HIV R-U5-del gag (HIV NL4 -3/454 -1126 ) » 
1539. .2396 

/note="HIV RRE (HIV NL4 -3/7622 -8459) " 
2422 . .2599 



misc feature 



LTR 



gene 



/note="HIV Flap" 
3594.. 4183 
/note= " WRE element " 
4203 . .4892 
/note="3 1 SIN LTR" 
2772 . .3452 
/note="dsRed2 " 
1755 A 1638 C 1722 G 



1591 T 



0 OTHER 



BASE COUNT 
ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGCCCAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

3 61 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

4 81 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 

661 TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 
TTGTTTTGGC 
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721 ACCAAAATCA ACGGGACTTT 
ACGCAAATGG 

7 81 GCGGTAGGCG TGTACGGTGG 
TACTGGGTCT 
5 841 CTCTGGTTAG AC CAGATCTG 

CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC 
GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT 
10 TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA 
CGCAGGACTC 

10 81 GGCTTGCTGA AGCGCGCACG 
ACGCCAAAAA 
15 1141 TTTTGACTAG CGGAGGCTAG 

ATTAAGCGGG 

12 01 GGAGAATTAG ATCGCGATGG 
AAAAAATATA 

12 61 AATTAAAACA TATAGTATGG 
20 AATCCTGGCC 

13 21 TGTTAGAAAC ATCAGAAGGC 
TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA 
TGTGTGCATC 
25 1441 AAAGGATAGA GATAAAAGAC 

GAGCAAAACA 

15 01 AAAGTAAGAC CACCGCACAG 
TGGAGGAGGA 

15 61 GATATGAGGG ACAATTGGAG 
30 AATTGAAC C A 

1621 TTAGGAGTAG CACCCACCAA 
AAGAGCAGTG 

16 81 GGAATAGGAG CTTTGTTCCT 
GGGCGCAGCG 

35 1741 TCAATGACGC TGACGGTACA 

GCAGCAGAAC 

18 01 AATTTGCTGA GGGCTATTGA 
CTGGGGCATC 

18 61 AAGCAGCTCC AGGCAAGAAT 
40 ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
GAATGCTAGT 

1981 TGGAGTAATA AATCTCTGGA 
GTGGGACAGA 
45 2 041 GAAATTAACA ATTACACAAG 

AAACCAGCAA 

2101 GAAAAGAATG AACAAGAATT 
GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
50 AGGCTTGGTA 

22 21 GGTTTAAGAA TAGTTTTTGC 
GGGATATTCA 

22 81 CCATTATCGT TTCAGACCCA 
CGAAGGAATA 
55 2 341 GAAGAAGAAG GTGGAGAGAG 

CGGATCGGCA 

24 01 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 



CCAAAATGTC GTAACAACTC CGCCCCATTG 
GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AAC CAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TACTGGGACA GCTACAACCA 
TCATTATATA ATACAGTAGC AACCCTCTAT 
AC C AAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
AAGTGAATTA TATAAATATA AAGTAGTAAA 
GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGCCAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGCTGTG GAAAGATACC TAAAGGATCA 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAATCGCA 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
G AC AAATGG C AGTATTCATC CACAATTTTA 
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2461 GGGGATTGGG GGGTACAGTG 
CAGACATACA 

2521 AACTAAAGAA TTACAAAAAC 
ATTACAGGGA 

25 81 CAGCAGAGAT CCAGTTTGGT 
CGCTAGCCGT 

2 641 TAATTAAGCC TCGAGGTGGA 
CAGCAGGTCG 

27 01 AGGGAC CTAA TAACTTCGTA 
GTTCCAAGCT 

2 761 TAAGCGGCCG CGCCACCATG 
ATGCGCTTCA 

2 821 AGGTGCGCAT GGAGGGCACC 
GAGGGCGAGG 

2 881 GCCGCCCCTA CGAGGGCCAC 
GGCCCCCTGC 

2 941 CCTTCGCCTG GGACATCCTG 
TACGTGAAGC 

3 0 01 ACCCCGCCGA CATCCCCGAC 
AAGTGGGAGC 

3 0 61 GCGTGATGAA CTTCGAGGAC 
TCCCTGCAGG 

3121 ACGGCTGCTT CATCTACAAG 
GACGGCCCCG 

3181 TGATGCAGAA GAAGACCATG 
CCCCGCGACG 

3241 GCGTGCTGAA GGGGGAGACC 
CACTACCTGG 

33 01 TGGAGTTCAA GTCCATCTAC 
TACTACTACG 

33 61 TGGACGCCAA GCTGGACATC 
GAGCAGTACG 

3421 AGCGCACCGA GGGCCGCCAC 
CGCGCCATGG 

34 81 ATCCGCGAAT TCGTCGAGGG 
CGAAGTTATA 

3 541 CATGTTTAAG GGTTCCGGTT 
CGATAATCAA 

3 601 CCTCTGGATT ACAAAATTTG 
TGCTCCTTTT 

3661 ACGCTATGTG GATACGCTGC 
CCGTATGGCT 

3721 TTCATTTTCT CCTCCTTGTA 
GTTGTGGCCC 

3781 GTTGTCAGGC AACGTGGCGT 
CACTGGTTGG 

3 841 GGCATTGCCA CCACCTGTCA 
CCCTATTGCC 

3 901 ACGGCGGAAC TCATCGCCGC 
GCTGTTGGGC 

3 961 ACTGACAATT CCGTGGTGTT 
GCTCGCCTGT 

4 021 GTTGCCACCT GGATTCTGCG 
CCTCAATCCA 

4 081 GCGGACCTTC CTTCCCGCGG 
TCTTCGCCTT 

4141 CGCCCTCAGA CGAGTCGGAT 
CCGTCGACCT 



CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGTACCGGG CCCGCTCTAG ACGGTTAACG 
CGGTATCGAT AAGCTCGCTT CACGAGATTC 
TAGCATACAT TATACGAAGT TATATTAAGG 
GCCTCCTCCG AGAACGTCAT CACCGAGTTC 
GTGAACGGCC ACGAGTTCGA GATCGAGGGC 
AACACCGTGA AGCTGAAGGT GACCAAGGGC 
TCCCCCCAGT TCCAGTACGG CTCCAAGGTG 
TACAAGAAGC TGTCCTTCCC CGAGGGCTTC 
GGCGGCGTGG CGACCGTGAC CCAGGACTCC 
GTGAAGTTCA TCGGCGTGAA CTTCCCCTCC 
GGCTGGGAGG CCTCCACCGA GCGCCTGTAC 
CACAAGGCCC TGAAGCTGAA GGACGGCGGC 
ATGGCCAAGA AGCCCGTGCA GCTGCCCGGC 
ACCTCCCACA ACGAGGACTA CACCATCGTG 
CACCTGTTCC TGATGCATGC CCCGGGATGG 
ACCTAATAAC TTCGTATAGC ATACATTATA 
CCACTAGGTA CAATTCGATA TCAAGCTTAT 
TGAAAGATTG ACTGGTATTC TTAACTATGT 
TTTAATGCCT TTGTATCATG CTATTGCTTC 
TAAATCCTGG TTGCTGTCTC TTTATGAGGA 
GGTGTGCACT GTGTTTGCTG ACGCAACCCC 
GCTCCTTTCC GGGACTTTCG CTTTCCCCCT 
CTGCCTTGCC CGCTGCTGGA CAGGGGCTCG 
GTCGGGGAAA TCATCGTCCT TTCCTTGGCT 
CGGGACGTCC TTCTGCTACG TCCCTTCGGC 
CCTGCTGCCG GCTCTGCGGC CTCTTCCGCG 
CTCCCTTTGG GCCGCCTCCC CGCATCGATA 



Page 120 of 171 



WO 2004/022722 



PCT/US2003/028111 



42 01 CGATCGAGAC CTAGAAAAAC 
CTACCAATGC 

42 61 TGATTGTGCC TGGCTAGAAG 
TCACACCTCA 

5 4321 GGTACCTTTA AGACCAATGA 

TTTTAAAAGA 

43 81 AAAGGGGGGA CTGGAAGGGC 
TTGATCTGTG 

4441 GATCTACCAC ACACAAGGCT 
10 GGCCAGGGAT 

45 01 CAGATATCCA CTGACCTTTG 
AAGAGAAGGT 

4561 AGAAGAAGCC AATGAAGGAG 
TGCATGGGAT 
15 4 621 GGATGACCCG GAGAGAGAAG 

CATTTCATCA 

4681 CATGGCCCGA GAGCTGCATC 
AGATCTGAGC 

4 741 CTGGGAGCTC TCTGGCTAAC 
20 GCTTGCCTTG 

48 01 AGTGCTTCAA GTAGTGTGTG 
GATCCCTCAG 

4 861 ACCCTTTTAG TCAGTGTGGA 
AGCAAAAGGC 
25 4 921 CAGGAACCGT AAAAAGGCCG 

CCCCTGACGA 

4 981 GCATCACAAA AATCGACGCT 
TATAAAGATA 

5 041 CCAGGCGTTT CCCCCTGGAA 
30 TGCCGCTTAC 

5101 CGGATACCTG TCCGCCTTTC 
GCTCACGCTG 

5161 TAGGTATCTC AGTTCGGTGT 
ACGAACCCCC 
35 5221 CGTTCAGCCC GACCGCTGCG 

ACCCGGTAAG 

52 81 ACACGACTTA TCGCCACTGG 
C GAGGT ATGT 

5341 AGGCGGTGCT ACAGAGTTCT 
40 GAAGAACAGT 

5401 ATTTGGTATC TGCGCTCTGC 
GTAGCTCTTG 

54 61 ATCCGGCAAA CAAACCACCG 
AGCAGATTAC 
45 5521 GCGCAGAAAA AAAGGATCTC 

CTGACGCTCA 

5581 GTGGAACGAA AACTCACGTT 
GGATCTTCAC 

5 641 CTAGATCCTT TTAAATTAAA 
50 ATGAGTAAAC 

5701 TTGGTCTGAC AGTTACCAAT 
TCTGTCTATT 

5 761 TCGTTCATCC ATAGTTGCCT 
GGGAGGGCTT 
55 5 821 ACCATCTGGC CCCAGTGCTG 

CTCCAGATTT 

5 881 ATCAGCAATA AACCAGCCAG 
CAACTTTATC 



ATGGAGCAAT CACAAGTAGC AAT AC AG C AG 
CACAAGAGGA GGAGGAGGTG GGTTTTCCAG 
CTTACAAGGC AGCTGTAGAT CTTAGCCACT 
TAATTCACTC CCAACGAAGA CAAGATATCC 
ACTTCCCTGA TTGGCAGAAC TAG AC AC C AG 
GATGGTGCTA CAAGCTAGTA CCAGTTGAGC 
AGAACACCCG CTTGTTACAC CCTGTGAGCC 
TATTAGAGTG GAGGTTTGAC AGCCGCCTAG 
CGGACTGTAC TGGGTCTCTC TGGTTAGACC 
TAGGGAACCC ACTGCTTAAG CCTCAATAAA 
CCCGTCTGTT GTGTGACTCT GGTAACTAGA 
AAATCTCTAG CAGCATGTGA GCAAAAGGCC 
CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC 
CAAGTCAGAG GTGGCGAAAC CCGACAGGAC 
GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 
TCCCTTCGGG AAGCGTGGCG CTTTCTCATA 
AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 
CCTTATCCGG TAACTATCGT CTTGAGTC C A 
CAGCAGCCAC TGGTAACAGG ATTAGCAGAG 
TGAAGTGGTG GCCTAACTAC GGCTACACTA 
TGAAGCCAGT TACCTTCGGA AAAAGAGTTG 
CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 
AAGAAGATCC TTTGATCTTT TCTACGGGGT 
AAGGGATTTT GGTCATGAGA TTATCAAAAA 
AATGAAGTTT TAAATCAATC TAAAGTATAT 
GCTTAATCAG TGAGGCACCT ATCTCAGCGA 
GACTCCCCGT CGTGTAGATA ACTACGATAC 
CAATGATACC GCGAGACCCA CGCTCACCGG 
CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG 
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5941 CGCCTCCATC CAGTCTATTA 
CGCCAGTTAA 

60 01 TAGTTTGCGC AACGTTGTTG 
CGTCGTTTGG 
5 6 0 61 TATGGCTTCA TTCAGCTCCG 

CCCCCATGTT 

6121 GTGCAAAAAA GCGGTTAGCT 
AGTTGGCCGC 

6181 AGTGTTATCA CTCATGGTTA 
10 TGCCATCCGT 

6241 AAGATGCTTT TCTGTGACTG 
AGTGTATGCG 

63 01 GCGACCGAGT TGCTCTTGCC 
ATAGCAGAAC 

15 63 61 TTTAAAAGTG CTCATCATTG 

GGATCTTACC 

6421 GCTGTTGAGA TCCAGTTCGA 
CAGCATCTTT 

64 81 TACTTTCACC AGCGTTTCTG 
20 CAAAAAAGGG 

6541 AATAAGGGCG ACACGGAAAT 
ATTATTGAAG 

6601 CATTTATCAG GGTTATTGTC 
AGAAAAATAA 
25 6661 ACAAATAGGG GTTCCGCGCA 

// 

(SEQ ID NO: 4) 



ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT 
CCATTGCTAC AGGCATCGTG GTGTCACGCT 
GTTCCCAACG ATCAAGGCGA GTTACATGAT 
CCTTCGGTCC TCCGATCGTT GTCAGAAGTA 
TGGCAGCACT GCATAATTCT CTTACTGTCA 
GTGAGTACTC AACCAAGTCA TTCTGAGAAT 
CGGCGTCAAT ACGGGATAAT ACCGCGCCAC 
GAAAACGTTC TTCGGGGCGA AAACTCTCAA 
TGTAACCCAC TCGTGCACCC AACTGATCTT 
GGTGAGCAAA AACAGGAAGG CAAAATGCCG 
GTTGAATACT CATACTCTTC CTTTTTCAAT 
TCATGAGCGG ATACATATTT GAATGTATTT 
CATTTCCCCG AAAAGTGCCA CCTGAC 
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[00348] pLL3.3 

LOCUS PLENTILOX 
2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 

promoter 



724 8 BP DS-DNA CIRCULAR SYN 



23- JAN - 



gene 

rep_origin 
misc_recomb 
misc_recomb 
LTR 

mi s c_f eature 
mis cofeature 
mis cofeature 
LTR 
frag 

promoter 

1815 A 



Location/Qualifiers 
212 . . 816 
/note="CMV promoter/enhancer 1" 
6255. .7115 
/note= !, AmpR" 
5437 . .6110 
/note="pUC" 
3931. .3966 
/note="Lox 1" 
4048 . .4081 
/note="Lox2 » 
835 . . 1509 

/note="5' HIV R-U5-del gag (HIV NL4-3/454 -1126) " 
1539. .2396 

/note="HIV RRE (HIV NL4 -3/7622 -8459) » 
2422 . .2599 
/note="HIV Flap" 
4136. .4725 
/note= M WRE element" 
4745 . .5434 
/note="3 » SIN LTR" 
2627 . .3847 

/note="13 to 1233 of pUB6/V5 -HisA" 
2632 . .3841 
/note="UbC promoter" 

1695 C 1947 G 1791 T 0 OTHER 



BASE COUNT 
ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGC C CAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

361 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGT T C C CAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

481 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 
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661 TTGACTCACG 
TTGTTTTGGC 

721 ACCAAAATCA 
ACGCAAATGG 

781 GCGGTAGGCG 
TACTGGGTCT 

841 CTCTGGTTAG 
CCCACTGCTT 

901 AAGCCTCAAT 
GTTGTGTGAC 

961 TCTGGTAACT 
TAGCAGTGGC 

1021 GCCCGAACAG 
CGCAGGACTC 

1081 GGCTTGCTGA 
ACGCCAAAAA 

1141 TTTTGACTAG 
ATTAAGCGGG 

12 01 G G AGAATT AG 
AAAAAATATA 

12 61 AATTAAAACA 
AATCCTGGCC 

1321 TGTTAGAAAC 
TCCCTTCAGA 

13 81 CAGGATCAGA 
TGTGTGCATC 

1441 AAAGGATAGA 
GAGCAAAACA 

15 01 AAAGTAAGAC 
TGGAGGAGGA 

15 61 GATATGAGGG 
AATTGAACCA 

1621 TTAGGAGTAG 
AAGAGCAGTG 

1681 GGAATAGGAG 
GGGCGCAGCG 

1741 TCAATGACGC 
GCAGCAGAAC 

1801 AATTTGCTGA 
CTGGGGCATC 

1861 AAGCAGCTCC 
ACAGCTCCTG 

1921 GGGATTTGGG 
GAATGCTAGT 

1981 TGGAGTAATA 
GTGGGACAGA 

2 041 GAAATTAACA 
AAACCAGCAA 

2101 GAAAAGAATG 
GAATTGGTTT 

2161 AACATAACAA 
AGGCTTGGTA 

2221 GGTTTAAGAA 
GGGATATTCA 

2281 CCATTATCGT 
CGAAGGAATA 

2341 GAAGAAGAAG 
CGGATCGGCA 



GGGATTTCCA AGTCTCCACC 
ACGGGACTTT CCAAAATGTC 
TGTACGGTGG GAGGTCTATA 
ACCAGATCTG AGCCTGGGAG 
AAAGCTTGCC TTGAGTGCTT 
AGAGATCCCT CAGACCCTTT 
GGACTTGAAA GCGAAAGGGA 
AGCGCGCACG GCAAGAGGCG 
CGGAGGCTAG AAGGAGAGAG 
ATCGCGATGG GAAAAAATTC 
TATAGTATGG GCAAGCAGGG 
ATCAGAAGGC TGTAGACAAA 
AGAACTTAGA TCATTATATA 
GATAAAAGAC ACCAAGGAAG 
CACCGCACAG CAAGCGGCCG 
ACAATTGGAG AAGTGAATTA 
CACCCACCAA GGCAAAGAGA 
CTTTGTTCCT TGGGTTCTTG 
TGACGGTACA GGCCAGACAA 
GGGCTATTGA GGCGCAACAG 
AGGCAAGAAT CCTGGCTGTG 
GTTGCTCTGG AAAACTCATT 
AATCTCTGGA ACAGATTTGG 
ATTACACAAG CTTAATACAC 
AACAAGAATT ATTGGAATTA 
ATTGGCTGTG GTATATAAAA 
TAGTTTTTGC TGTACTTTCT 
TTCAGACC C A CCTCCCAACC 
GTGGAGAGAG AGACAGAGAC 



CCATTGACGT CAATGGGAGT 
GTAACAACTC CGCCCCATTG 
TAAGCAGCGC GTTTTGCCTG 
CTCTCTGGCT AACTAGGGAA 
CAAGTAGTGT GTGCCCGTCT 
TAGTCAGTGT GGAAAATCTC 
AACCAGAGGA GCTCTCTCGA 
AGGGGCGGCG ACTGGTGAGT 
ATGGGTGCGA GAGCGTCAGT 
GGTTAAGGCC AGGGGGAAAG 
AGCTAGAACG ATTCGCAGTT 
TACTGGGACA GCTACAACCA 
ATACAGTAGC AACCCTCTAT 
CTTTAGACAA GATAGAGGAA 
GCCGCGCTGA TCTTCAGACC 
TATAAATATA AAGTAGTAAA 
AGAGTGGTGC AGAGAGAAAA 
GGAGCAGCAG GAAGCACTAT 
TTATTGTCTG GTATAGTGCA 
CATCTGTTGC AACTCACAGT 
GAAAGATACC TAAAGGATCA 
TGCACCACTG CTGTGCCTTG 
AATCACACGA CCTGGATGGA 
TCCTTAATTG AAGAATCGCA 
GATAAATGGG CAAGTTTGTG 
TTATTCATAA TGATAGTAGG 
ATAGTGAATA GAGTTAGGCA 
CCGAGGGGAC CCGACAGGCC 
AGATCCATTC GATTAGTGAA 
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24 01 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 

24 61 GGGGATTGGG GGGTACAGTG 
CAGACATACA 
5 2521 AACTAAAGAA TTACAAAAAC 

ATTACAGGGA 

2581 CAGCAGAGAT CCAGTTTGGT 
TGGCCTCCGC 

2 641 GCCGGGTTTT GGCGCCTCCC 
10 GCCACGTCAG 

2 701 ACGAAGGGCG CAGGAGCGTC 
CGGCCCGCTG 

2 761 CTCATAAGAC TCGGCCTTAG 
GACGGGACTT 
15 2 821 GGGTGACTCT AGGGCACTGG 

AAAAGTAGTC 

2 881 CCTTCTCGGC GATTCTGCGG 
GATTATATAA 

2 941 GGACGCGCCG GGTGTGGCAC 
20 CGCGGTTCTT 

3 001 GTTTGTGGAT CGCTGTGATC 
GGCCGGGGCT 

3 0 61 TTCGTGGCCG CCGGGCCGCT 
CAAGGGCTGT 
25 3121 AGTCTGGGTC CGCGAGCAAG 

CAGCAAAATG 

3181 GCGGCTGTTC CCGAGTCTTG 
GGTCGTTGAA 

3241 ACAAGGTGGG GGGCATGGTG 
30 CGCTAATGCG 

3 3 01 GGAAAGCTCT TATTCGGGTG 
GACGTGAAGT 

3361 TTGTCACTGA CTGGAGAACT 
TATGCGGTGC 
35 3421 CGTTGGGCAG TGCACCCGTA 

TGACGTCACC 

34 81 CGTTCTGTTG GCTTATAATG 
GGTAGGCTTT 

3 541 TCTCCGTCGC AGGACGCAGG 
40 GACAGGCGCC 

3 601 GGACCTCTGG TGAGGGGAGG 
TTTTATGTAC 

3 661 CTATCTTCTT AAGTAGCTGA 
TGGCGAGTGT 
45 3 721 GTTTTGTGAA GTTTTTTAGG 

TATGTAATTT 

3 781 TCAGTGTTAG ACTAGTAAAT 
TTTTGTTAGA 

3 841 CGAAGCTAAC GCGCTAGCCG 
50 TAAGCTCGCT 

3 901 TCACGAGATT CCAGCAGGTC 
TTATACGAAG 

3 961 TTATATTAAG GGTTCCAAGC 
GGCGCGCCAT 

55 4 021 GGATCCGCGA ATTCGTCGAG 

TACGAAGTTA 

4 081 TACATGTTTA AGGGTTCCGG 
ATCGATAATC 



GACAAATGGC AGTATTCATC CACAATTTTA 
CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATT CAAAAT TTTCGGGTTT 
TAGTAC CGGG CCCGCTCTAG ACGGTTGATC 
GCGGGCGCCC CCCTCCTCAC GGCGAGCGCT 
CTGATCCTTC CGCCCGGACG CTCAGGACAG 
AACCCCAGTA TCAGCAGAAG GACATTTTAG 
TTTTCTTTCC AGAGAGCGGA ACAGGCGAGG 
AGGGATCTCC GTGGGGCGGT GAACGCCGAT 
AGCTAGTTCC GTCGCAGCCG GGATTTGGGT 
GTCACTTGGT GAGTAGCGGG CTGCTGGGCT 
CGGTGGGACG GAAGCGTGTG GAGAGAC CGC 
GTTGCCCTGA AC TGGGGGTT GGGGGGAGCG 
AATGGAAGAC GCTTGTGAGG CGGGCTGTGA 
GGCGGCAAGA ACCCAAGGTC TTGAGGCCTT 
AGATGGGCTG GGG C AC CAT C TGGGGACCCT 
CGGTTTGTCG TCTGTTGCGG GGGCGGCAGT 
CCTTTGGGAG CGCGCGCCCT CGTCGTGTCG 
CAGGGTGGGG CCACCTGCCG GTAGGTGTGC 
GTTCGGGCCT AGGGTAGGCT CTCCTGAATC 
GATAAGTGAG GCGTCAGTTT CTTTGGTCGG 
AGCTCCGGTT TTGAACTATG CGCTCGGGGT 
CACCTTTTGA AATGTAATCA TTTGGGTCAA 
TGTCCGCTAA ATTCTGGCCG TTTTTGGCTT 
TTAATTAAGC CTCGAGGTCG ACGGTATCGA 
GAGGGACCTA ATAACTTCGT ATAGCATACA 
TTAAGCGGCC GCCGATGCAT GCCCCGGGAT 
GGAC CTAAT A ACTTCGTATA GCATACATTA 
TTCCACTAGG TACAATTCGA TATCAAGCTT 
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4141 AACCTCTGGA 
GTTGCTCCTT 

42 01 TTACGCTATG 
TCCCGTATGG 

4261 CTTTCATTTT 
GAGTTGTGGC 

4321 C CGTTGTC AG 
CCCACTGGTT 

43 81 GGGGCATTGC 
CTCCCTATTG 

4441 CCACGGCGGA 
CGGCTGTTGG 

45 01 GCACTGACAA 
CTGCTCGCCT 

4561 GTGTTGCCAC 
GCCCTCAATC 

4621 CAGCGGACCT 
CGTCTTCGCC 

4 681 TTCGCCCTCA 
TACCGTCGAC 

4 741 CTCGATCGAG 
AGCTACCAAT 

4801 GCTGATTGTG 
AGTCACACCT 

4 861 CAGGTACCTT 
CTTTTTAAAA 

4 921 GAAAAGGGGG 
CCTTGATCTG 

4981 TGGATCTACC 
AGGGCCAGGG 

5041 ATCAGATATC 
GCAAGAGAAG 

5101 GTAGAAGAAG 
CCTGCATGGG 

5161 ATGGATGACC 
AGCATTTCAT 

52 21 CACATGGCCC 
CCAGATCTGA 

52 81 GCCTGGGAGC 
AAGCTTGCCT 

5341 TGAGTGCTTC 
GAGATCCCTC 

54 01 AGACCCTTTT 
CCAGCAAAAG 

5461 GCCAGGAACC 
CCCCCCTGAC 

5521 GAGCATCACA 
ACTATAAAGA 

5581 TACCAGGCGT 
CCTGCCGCTT 

5 641 ACCGGATACC 
TAGCTCACGC 

57 01 TGTAGGTATC 
GCACGAACCC 

5 761 CCCGTTCAGC 
CAACCCGGTA 

5821 AGACACGACT 
AGCGAGGTAT 



TTACAAAATT TGTGAAAGAT 
TGGATACGCT GCTTTAATGC 
CTCCTCCTTG TATAAATCCT 
GCAACGTGGC GTGGTGTGCA 
CACCACCTGT CAGCTCCTTT 
ACTCATCGCC GCCTGCCTTG 
TTCCGTGGTG TTGTCGGGGA 
CTGGATTCTG CGCGGGACGT 
TCCTTCCCGC GGCCTGCTGC 
GACGAGTCGG ATCTCCCTTT 
ACCTAGAAAA ACATGGAGCA 
CCTGGCTAGA AGCACAAGAG 
TAAGACCAAT GACTTACAAG 
GACTGGAAGG GCTAATTCAC 
ACACACAAGG CTACTTCCCT 
CACTGACCTT TGGATGGTGC 
CCAATGAAGG AGAGAACACC 
CGGAGAGAGA AGTATTAGAG 
GAGAGCTGCA TCCGGACTGT 
TCTCTGGCTA ACTAGGGAAC 
AAGTAGTGTG TGCCCGTCTG 
AGTCAGTGTG GAAAATCTCT 
GTAAAAAGGC CGCGTTGCTG 
AAAATCGACG CTCAAGTCAG 
TTCCCCCTGG AAGCTCCCTC 
TGTCCGCCTT TCTCCCTTCG 
TCAGTTCGGT GTAGGTCGTT 
CCGACCGCTG CGCCTTATCC 
TATCGCCACT GGCAGCAGCC 



TGAC TGGTAT TCTTAACTAT 
CTTTGTATCA TGCTATTGCT 
GGTTGCTGTC TCTTTATGAG 
CTGTGTTTGC TGACGCAACC 
CCGGGACTTT CGCTTTCCCC 
CCCGCTGCTG GACAGGGGCT 
AATCATCGTC CTTTCCTTGG 
CCTTCTGCTA CGTCCCTTCG 
CGGCTCTGCG GCCTCTTCCG 
GGGCCGCCTC CCCGCATCGA 
AT C AC AAGT A GCAATACAGC 
GAGGAGGAGG TGGGTTTTCC 
GCAGCTGTAG ATCTTAGCCA 
TCCCAACGAA GACAAGATAT 
GATTGGCAGA ACTACACACC 
TACAAGCTAG TACCAGTTGA 
CGCTTGTTAC ACCCTGTGAG 
TGGAGGTTTG ACAGCCGCCT 
ACTGGGTCTC TCTGGTTAGA 
CCACTGCTTA AGCCTCAATA 
TTGTGTGACT CTGGTAACTA 
AGCAGCATGT GAG C AAAAGG 
GCGTTTTTCC ATAGGCTCCG 
AGGTGGCGAA ACCCGACAGG 
GTGCGCTCTC CTGTTCCGAC 
GGAAGCGTGG CGCTTTCTCA 
CGCTCCAAGC TGGGCTGTGT 
GGTAACTATC GTCTTGAGTC 
ACTGGTAACA GGATTAG CAG 
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5881 GTAGGCGGTG CTACAGAGTT 
TAGAAGAACA 

5 941 G T ATTTGGT A TCTGCGCTCT 
TGGTAGCTCT 
5 60 01 TGATCCGGCA AACAAACCAC 

GCAGCAGATT 

60 61 ACGCGCAGAA AAAAAGGATC 
GTCTGACGCT 

6121 CAGTGGAACG AAAACTCACG 
10 AAGGATCTTC 

6181 ACCTAGATCC TTTTAAATTA 
ATATGAGTAA 

6241 ACTTGGTCTG AC AGTTAC C A 
GATCTGTCTA 
15 63 01 TTTCGTTCAT CCATAGTTGC 

ACGGGAGGGC 

63 61 TTACCATCTG GCCCCAGTGC 
GGCTCCAGAT 

6421 TTATCAGCAA TAAAC C AG C C 
20 TGCAACTTTA 

64 81 TCCGCCTCCA TCCAGTCTAT 
TTCGCCAGTT 

6541 AATAGTTTGC GCAACGTTGT 
CTCGTCGTTT 
25 66 01 GGTATGGCTT CATTCAGCTC 

ATCCCCCATG 

6661 TTGTGCAAAA AAGCGGTTAG 
TAAGTTGGCC 

6721 GCAGTGTTAT CACTCATGGT 
30 CATGCCATCC 

67 81 GTAAGATGCT TTTCTGTGAC 
ATAGTGTATG 

6841 CGGCGACCGA GTTGCTCTTG 
ACATAGCAGA 
35 6901 ACTTTAAAAG TGCTCATCAT 

AAGGATCTTA 

6961 CCGCTGTTGA GATCCAGTTC 
TTCAGCATCT 

7 021 TTTACTTTCA CCAGCGTTTC 
40 CGCAAAAAAG 

70 81 GGAATAAGGG CGACACGGAA 
ATATTATTGA 

7141 AGCATTTATC AGGGTTATTG 
TTAGAAAAAT 
45 72 01 AAACAAATAG GGGTTC CGCG 

// 



CTTGAAGTGG TGGCCTAACT ACGGCTACAC 
GCTGAAGCCA GTTACCTTCG GAAAAAGAGT 
CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA 
TCAAGAAGAT CCTTTGATCT TTTCTACGGG 
TTAAGGGATT TTGGTCATGA GATTATCAAA 
AAAATGAAGT TTTAAATCAA TCTAAAGTAT 
ATGCTTAATC AGTGAGGCAC CTATCTCAGC 
CTGACTCCCC GTCGTGTAGA TAACTACGAT 
TGCAATGATA CCGCGAGACC CACGCTCACC 
AGC CGGAAGG GCCGAGCGCA GAAGTGGTCC 
TAATTGTTGC CGGGAAGCTA GAGTAAGTAG 
TGCCATTGCT ACAGGCATCG TGGTGTCACG 
CGGTTCCCAA CGATCAAGGC GAGTTACATG 
CTCCTTCGGT CCTCCGATCG TTGTCAGAAG 
TATGGCAGCA CTGCATAATT CTCTTACTGT 
TGGTGAGTAC TCAACCAAGT CATTCTGAGA 
CCCGGCGTCA ATACGGGATA ATACCGCGCC 
TGGAAAACGT TCTTCGGGGC GAAAACTCTC 
GATGTAACCC ACTCGTGCAC CCAACTGATC 
TGGGTGAGCA AAAACAGGAA GGCAAAATGC 
ATGTTGAATA CTCATACTCT TCCTTTTTCA 
TCTCATGAGC GGATACATAT TTGAATGTAT 
CACATTTCCC CGAAAAGTGC CACCTGAC 



(SEQ ID NO: 5) 
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J 



[00349] pLL3.4 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



LOCUS 
2002 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES 

promoter 



PLENTILOX 



7 969 BP DS-DNA CIRCULAR SYN 



23-JAN- 



misc_recomb 
promoter 
gene 

rep__origin 
misc_recomb 
LTR 

mi s c_f e a t ur e 
misc_f eature 
misc_f eature 
LTR 
gene 

1988 A 



Location/Qualifiers 
212 . . 816 

/note= n CMV promoter/ enhancer 1" 
4769. .4802 
/note="LoxP» 
2632 . .3841 

/note= n UbC promoter" 
6976 . .7836 
/note="AmpR" 
6158 . . 6831 
/note="pUC" 
3931. .3966 
/note="LoxP" 
835 . .1509 

/note="5' HIV R-U5-del gag (HIV NL4-3/454-1126) " 
1539. .2396 

/note="HIV RRE (HIV NL4-3/7622-8459) » 
2422 . .2599 
/note="HIV Flap" 
4857 . .5446 
/note="WRE element" 
5466 . . 6155 
/note="3 1 SIN LTR" 
3993 . .4715 
/note="EGFP" 

1938 C 2150 G 1893 T 0 OTHER 



BASE COUNT 
ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGC CAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGC CC AT 

3 01 ATATGGAGTT C CGCGTTAC A TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

3 61 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

481 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTAT GGG ACT TTCCTACTTG GCAGTACATC 
TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 
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661 TTGACTCACG 
TTGTTTTGGC 

721 ACCAAAATCA 
ACGCAAATGG 

781 GCGGTAGGCG 
TACTGGGTCT 

841 CTCTGGTTAG 
CCCACTGCTT 

901 AAGCCTCAAT 
GTTGTGTGAC 

961 TCTGGTAACT 
TAGCAGTGGC 

1021 GCCCGAACAG 
CGCAGGACTC 

1081 GGCTTGCTGA 
ACGCCAAAAA 

1141 TTTTGACTAG 
ATTAAGCGGG 

12 01 GGAGAATTAG 
AAAAAATATA 

12 61 AATTAAAACA 
AATCCTGGCC 

1321 TGTTAGAAAC 
TCCCTTCAGA 

13 81 CAGGATCAGA 
TGTGTGCATC 

1441 AAAGGATAGA 
GAGCAAAACA 

1501 AAAGTAAGAC 
TGGAGGAGGA 

1561 GATATGAGGG 
AATTGAACCA 

1621 TTAGGAGTAG 
AAGAGCAGTG 

1681 GGAATAGGAG 
GGGCGCAGCG 

1741 TCAATGACGC 
GCAGCAGAAC 

1801 AATTTGCTGA 
CTGGGGCATC 

18 61 AAGCAGCTCC 
ACAGCTCCTG 

1921 GGGATTTGGG 
GAATGCTAGT 

1981 TGGAGTAATA 
GTGGGACAGA 

2 041 GAAATTAACA 
AAACCAGCAA 

2101 GAAAAGAATG 
GAATTGGTTT 

2161 AACATAACAA 
AGGCTTGGTA 

2221 GGTTTAAGAA 
GGGATATTCA 

22 81 CCATTATCGT 
CGAAGGAATA 

2341 GAAGAAGAAG 
CGGATCGGCA 



GGGATTTCCA AGTCTCCACC 
ACGGGACTTT CCAAAATGTC 
TGTACGGTGG GAGGTCTATA 
ACCAGATCTG AGCCTGGGAG 
AAAGCTTGCC TTGAGTGCTT 
AGAGATCCCT CAGACCCTTT 
GGACTTGAAA GCGAAAGGGA 
AGCGCGCACG GCAAGAGGCG 
CGGAGGCTAG AAGGAGAGAG 
ATCGCGATGG GAAAAAATTC 
TATAGTATGG GCAAGCAGGG 
ATCAGAAGGC TGTAGACAAA 
AGAACTTAGA TCATTATATA 
GATAAAAGAC AC C AAGGAAG 
CACCGCACAG CAAGCGGCCG 
ACAATTGGAG AAGTGAATTA 
CACCCACCAA GGCAAAGAGA 
CTTTGTTCCT TGGGTTCTTG 
TGACGGTACA GGCCAGACAA 
GGGCTATTGA GGCGCAACAG 
AGGCAAGAAT CCTGGCTGTG 
GTTGCTCTGG AAAACT C ATT 
AATCTCTGGA ACAGATTTGG 
ATTACACAAG CTTAATACAC 
AACAAGAATT ATTGGAATTA 
ATTGGCTGTG GTATATAAAA 
TAGTTTTTGC TGTACTTTCT 
TTCAGACCCA CCTCCCAACC 
GTGGAGAGAG AGACAGAGAC 



CCATTGACGT CAATGGGAGT 
GTAACAACTC CGCCCCATTG 
TAAGCAGCGC GTTTTGCCTG 
CTCTCTGGCT AACTAGGGAA 
CAAGTAGTGT GTGCCCGTCT 
TAGTCAGTGT GGAAAATCTC 
AAC CAGAGGA GCTCTCTCGA 
AGGGGCGGCG AC TGGTG AGT 
ATGGGTGCGA GAGCGTCAGT 
GGTTAAGGCC AGGGGGAAAG 
AG CTAGAACG ATTCGCAGTT 
TACTGGGACA GCTACAACCA 
ATACAGTAGC AACCCTCTAT 
CTTTAGACAA GATAGAGGAA 
GCCGCGCTGA TCTTCAGACC 
TATAAATATA AAGTAGTAAA 
AGAGTGGTGC AGAGAGAAAA 
GGAGCAGCAG GAAGCAC TAT 
TTATTGTCTG GTATAGTGCA 
CATCTGTTGC AACTCACAGT 
G AAAG AT AC C TAAAGGATCA 
TGCACCACTG CTGTGCCTTG 
AATCACACGA CCTGGATGGA 
TCCTTAATTG AAGAATCGCA 
GATAAATGGG CAAGTTTGTG 
TTATTCATAA TGATAGTAGG 
ATAGTGAATA GAGTTAGGCA 
CCGAGGGGAC CCGACAGGCC 
AGATCCATTC GATTAGTGAA 



Page 129 of 171 



WO 2004/022722 



PCT/US2003/028111 



24 01 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 

24 61 GGGGATTGGG GGGTACAGTG 
CAGACATACA 

5 2 521 AACTAAAGAA TTACAAAAAC 

AT T AC AGGGA 

25 81 CAGCAGAGAT CCAGTTTGGT 
TGGCCTCCGC 

2 641 GCCGGGTTTT GGCGCCTCCC 
10 GCCACGTCAG 

2 701 ACGAAGGGCG CAGGAGCGTC 
CGGCCCGCTG 

2 7 61 CTCATAAGAC TCGGCCTTAG 
GACGGGACTT 
15 2 821 GGGTGACTCT AGGGCACTGG 

AAAAGTAGTC 

2 8 81 CCTTCTCGGC GATTCTGCGG 
GATTATATAA 

2 941 GGACGCGCCG GGTGTGGCAC 
20 CGCGGTTCTT 

3 0 01 GTTTGTGGAT CGCTGTGATC 
GGCCGGGGCT 

3 061 TTCGTGGCCG CCGGGCCGCT 
CAAGGGCTGT 
25 3121 AGTCTGGGTC CGCGAGCAAG 

CAGCAAAATG 

3181 GCGGCTGTTC CCGAGTCTTG 
GGTCGTTGAA 

3241 ACAAGGTGGG GGGCATGGTG 
30 CGCTAATGCG 

33 01 GGAAAGCTCT TATTCGGGTG 
GACGTGAAGT 

33 61 TTGTCACTGA CTGGAGAACT 
TATGCGGTGC 

35 3 421 CGTTGGGCAG TGCACCCGTA 

TGACGTCACC 

34 81 CGTTCTGTTG GCTTATAATG 
GGTAGGCTTT 

3 541 TCTCCGTCGC AGGACGCAGG 
40 GACAGGCGCC 

3 601 GGACCTCTGG TGAGGGGAGG 
TTTTATGTAC 

3 661 CTATCTTCTT AAGTAGCTGA 
TGGCGAGTGT 
45 3 721 GTTTTGTGAA GTTTTTTAGG 

TATGTAATTT 

3 781 TCAGTGTTAG ACTAGTAAAT 
TTTTGTTAGA 

3 841 CGAAGCTAAC GCGCTAGCCG 
50 TAAGCTCGCT 

3 901 TCACGAGATT CCAGCAGGTC 
TTATACGAAG 

3 961 TTATATTAAG GGTTCCAAGC 
GGCGAGGAGC 
55 4 021 TGTTCACCGG GGTGGTGCCC 

GGCCACAAGT 

40 81 TCAGCGTGTC CGGCGAGGGC 
CTGAAGTTCA 



GACAAATGGC AGTATTCATC CACAATTTTA 
CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGTACCGGG CCCGCTCTAG ACGGTTGATC 
GCGGGCGCCC CCCTCCTCAC GGCGAGCGCT 
CTGATCCTTC CGCCCGGACG CTCAGGACAG 
AACCC CAGTA TCAGCAGAAG GACATTTTAG 
TTTTCTTTCC AGAGAGCGGA ACAGGCGAGG 
AGGGATCTCC GTGGGGCGGT GAACGCCGAT 
AGCTAGTTCC GTCGCAGCCG GGATTTGGGT 
GTCACTTGGT GAGTAGCGGG CTGCTGGGCT 
CGGTGGGACG GAAGCGTGTG GAGAGACCGC 
GTTGCCCTGA ACTGGGGGTT GGGGGGAGCG 
AATGGAAGAC GCTTGTGAGG CGGGCTGTGA 
GGCGGCAAGA ACCCAAGGTC TTGAGGCCTT 
AGATGGGCTG GGG C AC CAT C TGGGGACCCT 
CGGTTTGTCG TCTGTTGCGG GGGCGGCAGT 
CCTTTGGGAG CGCGCGCCCT CGTCGTGTCG 
CAGGGTGGGG CCACCTGCCG GTAGGTGTGC 
GTTCGGGCCT AGGGTAGGCT CTCCTGAATC 
GATAAGTGAG GCGTCAGTTT CTTTGGTCGG 
AGCTCCGGTT TTGAACTATG CGCTCGGGGT 
CACCTTTTGA AATGTAATCA TTTGGGTCAA 
TGTCCGCTAA ATTCTGGCCG TTTTTGGCTT 
TTAATTAAGC CTCGAGGTCG ACGGTATCGA 
GAGGGACCTA ATAACTTCGT ATAGCATACA 
TTAAGCGGCC GCGCCACCAT GGTGAGCAAG 
ATCCTGGTCG AGCTGGACGG CGACGTAAAC 
GAGGGCGATG CCACCTACGG CAAGCTGACC 
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4141 TCTGCACCAC CGGCAAGCTG 
CTGACCTACG 

42 01 GCGTGCAGTG CTTCAGCCGC 
TTCAAGTCCG 

42 61 CCATGCCCGA AGGCTACGTC 
GGCAACTACA 

4321 AGACCCGCGC CGAGGTGAAG 
GAGCTGAAGG 

43 81 GCATCGACTT CAAGGAGGAC 
AACTACAACA 

4441 GCCACAACGT CTATATCATG 
AACTTCAAGA 

45 01 TCCGCCACAA CATCGAGGAC 
CAGAACACCC 

45 61 CCATCGGCGA CGGCCCCGTG 
CAGTCCGCCC 

4 621 TGAGCAAAGA CCCCAACGAG 
GTGACCGCCG 

46 81 CCGGGATCAC TCTCGGCATG 
TGGCGCGCCA 

4741 TGGATCCGCG AATTCGTCGA 
ATACGAAGTT 

4 8 01 AT AC ATGT TT AAGGGTTCCG 
TATCGATAAT 

4 8 61 CAACCTCTGG ATTACAAAAT 
TGTTGCTCCT 

4 921 TTTACGCTAT GTGGATACGC 
TTCCCGTATG 

4981 GCTTTCATTT TCTCCTCCTT 
GGAGTTGTGG 

5041 CCCGTTGTCA GGCAACGTGG 
CCCCACTGGT 

5101 TGGGGCATTG CCACCACCTG 
CCTCCCTATT 

5161 GCCACGGCGG AACTCATCGC 
TCGGCTGTTG 

5221 GGCACTGACA ATTCCGTGGT 
GCTGCTCGCC 

52 81 TGTGTTGCCA CCTGGATTCT 
GGCCCTCAAT 

5341 CCAGCGGACC TTCCTTCCCG 
GCGTCTTCGC 

54 01 CTTCGCCCTC AGACGAGTCG 
ATACCGTCGA 

5461 CCTCGATCGA GACCTAGAAA 
CAGCTAC CAA 

5521 TGCTGATTGT GCCTGGCTAG 
CAGTCACACC 

55 81 TCAGGTACCT TTAAGAC CAA 
ACTTTTTAAA 

5641 AGAAAAGGGG GGACTGGAAG 
TCCTTGATCT 

57 01 GTGGATCTAC CACACACAAG 
CAGGGCCAGG 

5761 GATCAGATAT CCACTGACCT 
AGCAAGAGAA 

5 821 GGTAGAAGAA GCCAATGAAG 
GCCTGCATGG 



CCCGTGCCCT GGCCCACCCT CGTGACCACC 
TACCCCGACC ACATGAAGCA GCACGACTTC 
CAGGAGCGCA CCATCTTCTT CAAGGACGAC 
TTCGAGGGCG ACACCCTGGT GAACCGCATC 
GGCAACATCC TGGGGCACAA GCTGGAGTAC 
GCCGACAAGC AGAAGAACGG CATCAAGGTG 
GGCAGCGTGC AGCTCGCCGA CCACTACCAG 
CTGCTGCCCG ACAACCACTA CCTGAGCACC 
AAGCGCGATC ACATGGTCCT GCTGGAGTTC 
GACGAGCTGT ACAAGATGCA TGCCCCGGGA 
GGGACCTAAT AACTTCGTAT AGCATACATT 
GTTCCACTAG GTACAATTCG AT AT CAAGCT 
TTGTGAAAGA TTGACTGGTA TTCTTAACTA 
TGCTTTAATG CCTTTGTATC ATGCTATTGC 
GTATAAATCC TGGTTGCTGT CTCTTTATGA 
CGTGGTGTGC ACTGTGTTTG CTGACGCAAC 
TCAGCTCCTT TCCGGGACTT TCGCTTTCCC 
CGCCTGCCTT GCCCGCTGCT GGACAGGGGC 
GTTGTCGGGG AAATCATCGT CCTTTCCTTG 
GCGCGGGACG TCCTTCTGCT ACGTCCCTTC 
CGGCCTGCTG CCGGCTCTGC GGCCTCTTCC 
GATCTCCCTT TGGGCCGCCT CCCCGCATCG 
AACATGGAGC AATCACAAGT AGCAATACAG 
AAGCACAAGA GGAGGAGGAG GTGGGTTTTC 
TGACTTACAA GGCAGCTGTA GATCTTAGCC 
GGCTAATTCA CTCCCAACGA AGACAAGATA 
GCTACTTCCC TGATTGGCAG AACTACACAC 
TTGGATGGTG CTACAAGCTA GTACCAGTTG 
GAGAGAACAC CCGCTTGTTA CACCCTGTGA 
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5 8 81 GATGGATGAC CCGGAGAGAG 
TAGCATTTCA 

5 941 TCACATGGCC CGAGAGCTGC 
AC C AGATCTG 

5 6001 AGCCTGGGAG CTCTCTGGCT 

AAAGCTTGCC 

6 061 TTGAGTGCTT CAAGTAGTGT 
AGAGATCCCT 

6121 CAGACCCTTT TAGTCAGTGT 
10 GCCAGCAAAA 

6181 GGCCAGGAAC CGTAAAAAGG 
GCCCCCCTGA 

6241 CGAGCATCAC AAAAAT CGAC 
GACTATAAAG 
15 63 01 ATACCAGGCG TTTCCCCCTG 

CCCTGCCGCT 

63 61 TAG CGGATAC CTGTCCGCCT 
ATAGCTCACG 

6421 CTGTAGGTAT CTCAGTTCGG 
20 TGCACGAACC 

64 81 CCCCGTTCAG CCCGACCGCT 
CCAACCCGGT 

6541 AAGACACGAC TTATCGCCAC 
GAGCGAGGTA 
25 6601 TGTAGGCGGT GCTACAGAGT 

CTAGAAGAAC 

6661 AGTATTTGGT ATCTGCGCTC 
TTGGTAGCTC 

6721 TTGATCCGGC AAACAAACCA 
30 AGCAGCAGAT 

6781 TACGCGCAGA AAAAAAGGAT 
GGTCTGACGC 

6841 TCAGTGGAAC GAAAACTCAC 
AAAGGATCTT 
35 69 01 CACCTAGATC CTTTTAAATT 

TATATGAGTA 

6961 AACTTGGTCT GACAGTTACC 
CGATCTGTCT 

7 021 ATTTCGTTCA TCCATAGTTG 
40 TACGGGAGGG 

70 81 CTTACCATCT GGCCCCAGTG 
CGGCTCCAGA 

7141 TTTATCAGCA ATAAAC C AG C 
CTGCAACTTT 
45 72 01 ATCCGCCTCC ATCCAGTCTA 

GTTCGCCAGT 

72 61 TAATAGTTTG CGCAACGTTG 
GCTCGTCGTT 

7321 TGGTATGGCT TCATTCAGCT 
50 GATCCCCCAT 

73 81 GTTGTGCAAA AAAG CGGTT A 
GTAAGTTGGC 

7441 CGCAGTGTTA TCACTCATGG 
TCATGCCATC 
55 75 01 CGTAAGATGC TTTTCTGTGA 

AATAGTGTAT 

75 61 GCGGCGACCG AGTTGCTCTT 
CACATAGCAG 



AAGTATTAGA GTGGAGGTTT GACAGCCGCC 
ATCCGGACTG TACTGGGTCT CTCTGGTTAG 
AACTAGGGAA CCCACTGCTT AAGCCTCAAT 
GTGCCCGTCT GTTGTGTGAC TCTGGTAACT 
GGAAAATCTC TAGCAGCATG TGAGCAAAAG 
CCGCGTTGCT GGCGTTTTTC CATAGGCTCC 
GCTCAAGTCA GAGGTGGCGA AACCCGACAG 
GAAGCTCCCT CGTGCGCTCT CCTGTTCCGA 
TTCTCCCTTC GGGAAGCGTG GCGCTTTCTC 
TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG 
GCGCCTTATC CGGTAACTAT CGTCTTGAGT 
TGGCAGCAGC CACTGGTAAC AGGATT AG CA 
TCTTGAAGTG GTGGCCTAAC TACGGCTACA 
TGCTGAAGCC AGTTACCTTC GGAAAAAGAG 
CCGCTGGTAG CGGTGGTTTT TTTGTTTGCA 
CTCAAGAAGA TCCTTTGATC TTTTCTACGG 
GTTAAGGGAT TTTGGTCATG AGATTATCAA 
AAAAATGAAG TTTTAAATCA ATCTAAAGTA 
AATGCTTAAT CAGTGAGGCA CCTATCTCAG 
CCTGACTCCC CGTCGTGTAG ATAACTACGA 
CTGCAATGAT ACCGCGAGAC CCACGCTCAC 
CAGCCGGAAG GGCCGAGCGC AGAAGTGGTC 
TTAATTGTTG CCGGGAAGCT AGAGTAAGTA 
TTGCCATTGC TACAGG CATC GTGGTGTCAC 
CCGGTTCCCA ACGATCAAGG CGAGTTACAT 
GCTCCTTCGG TCCTCCGATC GTTGTCAGAA 
TTATGGCAGC ACTGCATAAT TCTCTTACTG 
CTGGTGAGTA CTCAACCAAG TCATTCTGAG 
GCCCGGCGTC AATAC GGG AT AATACCGCGC 
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7621 AACTTTAAAA GTGCTCATCA TTGGAAAACG TTCTTCGGGG CGAAAACTCT 
CAAGGATCTT 

7 681 ACCGCTGTTG AGATCCAGTT CGATGTAACC CACTCGTGCA CCCAACTGAT 
CTTCAGCATC 

5 7741 TTTTACTTTC ACCAGCGTTT CTGGGTGAGC AAAAACAGGA AGGCAAAATG 

CCGCAAAAAA 

7 801 GGGAATAAGG GCGACACGGA AATGTTGAAT ACTCATACTC TTCCTTTTTC 
AATATTATTG 

7 861 AAGCATTTAT CAGGGTTATT GTCTCATGAG CGGATACATA TTTGAATGTA 
10 TTTAGAAAAA 

7 921 TAAACAAATA GGGGTTCCGC GCACATTTCC CCGAAAAGTG CCACCTGAC 

// 



(SEQ ID NO: 6) 

15 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



[00350] pLL3.5 

LOCUS PLL3 . 5 . GB 

2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 

promoter 



7927 BP DS-DNA CIRCULAR SYN 



23- JAN - 



mi s c_recomb 

promoter 

gene 

rep__origin 

misc_recotnb 

LTR 

misc_feature 
misc__f eature 
misc_f eature 
LTR 
gene 



Location/Qualifiers 
212 . . 816 
/note="CMV promoter/ enhancer 1" 
4727 . .4760 
/note="LoxP " 
2632 . .3841 
/note="UbC promoter" 
6934 . . 7794 
/note="AmpR" 
6116. . 6789 
/note="pUC " 
3931. .3966 
/note="LoxP" 
835. . 1509 

/note="5' HIV R-U5-del gag (HIV NL4 -3/454-1126) 
1539. .2396 

/note=»'HIV RRE (HIV NL4 -3/7622-8459) " 

2422 . .2599 

/note="HIV Flap" 

4815 . .5404 

/note="WRE element" 

5424 . . 6113 

/note="3 » SIN LTR" 

3993 . .4673 

/note="dsRed2 " 

1925 C 2151 G 



1893 T 



0 OTHER 



BASE COUNT 195 8 A 

ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGC CAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GG C CAG AT AT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGC CCAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

3 61 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

4 81 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 
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661 TTGACTCACG GGGATTTCCA 
TTGTTTTGGC 

721 AC CAAAATC A ACGGGACTTT 
ACGCAAATGG 

7 81 GCGGTAGGCG TGTACGGTGG 
TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG 
CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC 
GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT 
TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA 
CGCAGGACTC 

1081 GGCTTGCTGA AGCGCGCACG 
ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGCTAG 
ATTAAGCGGG 

12 01 GGAGAATTAG ATCGCGATGG 
AAAAAATATA 

12 61 AATTAAAACA TATAGTATGG 
AATCCTGGCC 

13 21 TGTTAGAAAC ATCAGAAGGC 
TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA 
TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC 
GAGCAAAACA 

15 01 AAAGTAAGAC CACCGCACAG 
TGGAGGAGGA 

15 61 GATATGAGGG ACAATTGGAG 
AATTGAACCA 

1621 TTAGGAGTAG CACCCACCAA 
AAGAGCAGTG 

1681 GGAATAGGAG CTTTGTTCCT 
GGGCGCAGCG 

1741 TCAATGACGC TGACGGTACA 
GCAGCAGAAC 

18 01 AATTTGCTGA GGGCTATTGA 
CTGGGGCATC 

18 61 AAGCAGCTCC AGGCAAGAAT 
ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
GAATGCTAGT 

1981 TGGAGTAATA AATCTCTGGA 
GTGGGACAGA 

2 041 GAAATTAACA ATTACACAAG 
AAACCAGCAA 

2101 GAAAAGAATG AACAAGAATT 
GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
AGGCTTGGTA 

2 221 GGTTTAAGAA TAGTTTTTGC 
GGGATATTCA 

22 81 CCATTATCGT TTCAGACCCA 
CGAAGGAATA 

2341 GAAGAAGAAG GTGGAGAGAG 
CGGATCGGCA 



AGTCTCCACC CCATTGACGT CAATGGGAGT 
CCAAAATGTC GTAACAACTC CGCCCCATTG 
GAGGTC TATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
GAAAAAATTC GGTTAAGGCC AG GGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TACTGGGACA GCTACAACCA 
TCATTATATA ATACAGTAGC AACCCTCTAT 
AC C AAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
AAGTGAATTA TATAAATATA AAGTAGTAAA 
GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGCCAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGCTGTG GAAAGATACC TAAAGGATCA 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAATCGCA 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
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2401 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 

2461 GGGGATTGGG GGGTACAGTG 
CAGACATACA 
5 2 521 AACTAAAGAA TTACAAAAAC 

ATTACAGGGA 

25 81 CAGCAGAGAT CCAGTTTGGT 
TGGCCTCCGC 

2 641 GCCGGGTTTT GGCGCCTCCC 
10 GCCACGTCAG 

2701 ACGAAGGGCG CAGGAGCGTC 
CGGCCCGCTG 

2 761 CTCATAAGAC TCGGCCTTAG 
GACGGGACTT 
15 2 821 GGGTGACTCT AGGGCACTGG 

AAAAGTAGTC 

2 8 81 CCTTCTCGGC GATTCTGCGG 
GATTATATAA 

2 941 GGACGCGCCG GGTGTGGCAC 
20 CGCGGTTCTT 

3 001 GTTTGTGGAT CGCTGTGATC 
GGCCGGGGCT 

3 0 61 TTCGTGGCCG CCGGGCCGCT 
CAAGGGCTGT 
25 3121 AGTCTGGGTC CGCGAGCAAG 

CAGCAAAATG 

3181 GCGGCTGTTC CCGAGTCTTG 
GGTCGTTGAA 

3241 ACAAGGTGGG GGGCATGGTG 
30 CGCTAATGCG 

33 01 GGAAAGCTCT TATTCGGGTG 
GACGTGAAGT 

33 61 TTGTCACTGA CTGGAGAACT 
TATGCGGTGC 

35 3421 CGTTGGGCAG TGCACCCGTA 

TGACGTCACC 

34 81 CGTTCTGTTG GCTTATAATG 
GGTAGGCTTT 

3541 TCTCCGTCGC AGGACGCAGG 
40 GACAGGCGCC 

3 601 GGACCTCTGG TGAGGGGAGG 
TTTTATGTAC 

3 661 CTATCTTCTT AAGTAGCTGA 
TGGCGAGTGT 
45 3 721 GTTTTGTGAA GTTTTTTAGG 

TATGTAATTT 

3781 TCAGTGTTAG ACTAGTAAAT 
TTTTGTTAGA 

3 841 CGAAGCTAAC GCGCTAGCCG 
50 TAAGCTCGCT 

3 901 TCACGAGATT CCAGCAGGTC 
TTATACGAAG 

3 961 TTATATTAAG GGTTCCAAGC 
GAGAACGTCA 

55 4 021 TCACCGAGTT CATGCGCTTC 

CACGAGTTCG 

4 081 AGATCGAGGG CGAGGGCGAG 
AAGCTGAAGG 



GACAAATGGC AGTATTCATC CACAATTTTA 
CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGT AC CGGG CCCGCTCTAG ACGGTTGATC 
GCGGGCGCCC CCCTCCTCAC GGCGAGCGCT 
CTGATCCTTC CGCCCGGACG CTCAGGACAG 
AACCCCAGTA TCAGCAGAAG GACATTTTAG 
TTTTCTTTCC AGAGAGCGGA ACAGGCGAGG 
AGGGATCTCC GTGGGGCGGT GAACGCCGAT 
AGCTAGTTCC GTCGCAGCCG GGATTTGGGT 
GTCACTTGGT GAGTAGCGGG CTGCTGGGCT 
CGGTGGGACG GAAGCGTGTG GAGAGAC CGC 
GTTGCCCTGA ACTGGGGGTT GGGGGGAGCG 
AATGGAAGAC GCTTGTGAGG CGGGCTGTGA 
GGCGGCAAGA ACCCAAGGTC TTGAGGCCTT 
AGATGGGCTG GGGCACCATC TGGGGACCCT 
CGGTTTGTCG TCTGTTGCGG GGGCGGCAGT 
CCTTTGGGAG CGCGCGCCCT CGTCGTGTCG 
CAGGGTGGGG CCACCTGCCG GTAGGTGTGC 
GTTCGGGCCT AGGGTAGGCT CTCCTGAATC 
GATAAGTGAG GCGTCAGTTT CTTTGGTCGG 
AGCTCCGGTT TTGAACTATG CGCTCGGGGT 
CACCTTTTGA AATGTAATCA TTTGGGTCAA 
TGTCCGCTAA ATTCTGGCCG TTTTTGGCTT 
TTAATTAAGC CTCGAGGTCG ACGGTATCGA 
GAGGGACCTA ATAACTTCGT ATAGCATACA 
TTAAGCGGCC GCGCCACCAT GGCCTCCTCC 
AAGGTGCGCA TGGAGGGCAC CGTGAACGGC 
GGCCGCCCCT ACGAGGGCCA CAAC AC CGTG 
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4141 TGACCAAGGG CGGCCCCCTG 
TTCCAGTACG 

42 01 GCTCCAAGGT GTACGTGAAG 
CTGTCCTTCC 

5 42 61 CCGAGGGCTT CAAGTGGGAG 

GCGACCGTGA 

4321 CCCAGGACTC CTCCCTGCAG 
ATCGGCGTGA 

43 81 ACTTCCCCTC CGACGGCCCC 
10 GCCTCCACCG 

4441 AGCGCCTGTA CCCCCGCGAC 
CTGAAGCTGA 

45 01 AGGACGGCGG CCACTACCTG 
AAGCCCGTGC 
15 4561 AGCTGCCCGG CTACTACTAC 

AACGAGGACT 

4 621 ACACCATCGT GGAGCAGTAC 
CTGATGCATG 

4 681 CCCCGGGATG GCGCGCCATG 
20 CTTCGTATAG 

4 741 CATACATTAT ACGAAGTTAT 
ACAATTCGAT 

4801 ATCAAGCTTA TCGATAATCA 
GACTGGTATT 
25 4 861 CTTAACTATG TTGCTCCTTT 

TTTGTATCAT 

4 921 GCTATTGCTT CCCGTATGGC 
GTTGCTGTCT 

4 981 CTTTATGAGG AGTTGTGGCC 
30 TGTGTTTGCT 

5 041 GACGCAACCC CCACTGGTTG 
CGGGACTTTC 

5101 GCTTTCCCCC TCCCTATTGC 
CCGCTGCTGG 
35 5161 ACAGGGGCTC GGCTGTTGGG 

ATCATCGTCC 

5221 TTTCCTTGGC TGCTCGCCTG 
CTTCTGCTAC 

52 81 GTCCCTTCGG CCCTCAATCC 
40 GGCTCTGCGG 

5341 CCTCTTCCGC GTCTTCGCCT 
GGCCGCCTCC 

54 01 CCGCATCGAT ACCGTCGACC 
TCACAAGTAG 
45 54 61 CAATACAGCA GCTACCAATG 

AGGAGGAGGT 

5521 GGGTTTTCCA GTCACACCTC 
CAGCTGTAGA 

5581 TCTTAGCCAC TTTTTAAAAG 
50 CCCAACGAAG 

5641 ACAAGATATC CTTGATCTGT 
ATTGGCAGAA 

5701 CTACACACCA GGGCCAGGGA 
ACAAGCTAGT 
55 5 761 AC C AGTTGAG CAAGAGAAGG 

GCTTGTTACA 

5821 CCCTGTGAGC CTGCATGGGA 
GGAGGTTTGA 



CCCTTCGCCT GGGACATCCT GTCCCCCCAG 
CACCCCGCCG ACATCCCCGA CTACAAGAAG 
CGCGTGATGA ACTTCGAGGA CGGCGGCGTG 
GACGGCTGCT TCATCTACAA GGTGAAGTTC 
GTGATGCAGA AGAAGACCAT GGGCTGGGAG 
GGCGTGCTGA AGGG CGAGAC CCACAAGGCC 
GTGGAGTTCA AGTC CAT CTA CATGGCCAAG 
GTGGACG C C A AGCTGGACAT CACCTCCCAC 
GAGCGCACCG AGGGCCGCCA CCACCTGTTC 
GATCCGCGAA TTCGTCGAGG G AC CTAATAA 
ACATGTTTAA GGGTTCCGGT TCCACTAGGT 
ACCTCTGGAT TACAAAATTT GTGAAAGATT 
TACGCTATGT GGATACGCTG CTTTAATGCC 
TTTCATTTTC TCCTCCTTGT ATAAATCCTG 
CGTTGTCAGG CAACGTGGCG TGGTGTGCAC 
GGGCATTGCC ACCACCTGTC AGCTCCTTTC 
CACGGCGGAA CTCATCGCCG CCTGCCTTGC 
CACTGACAAT TCCGTGGTGT TGTCGGGGAA 
TGTTGCCACC TGGATTCTGC GCGGGACGTC 
AGCGGACCTT CCTTCCCGCG GCCTGCTGCC 
TCGCCCTCAG ACGAGTCGGA TCTCCCTTTG 
TCGATCGAGA CCTAGAAAAA CATGGAGCAA 
CTGATTGTGC CTGGCTAGAA GCACAAGAGG 
AGGTACCTTT AAGACCAATG ACTTACAAGG 
AAAAGGGGGG ACTGGAAGGG CTAATTCACT 
GGATCTACCA CACACAAGGC TACTTCCCTG 
T C AG AT AT C C ACTGACCTTT GGATGGTGCT 
TAGAAGAAGC CAATGAAGGA G AG AAC AC C C 
TGGATGACCC GGAGAGAGAA GTATTAGAGT 
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58 81 CAGCCGCCTA GCATTTCATC 
CTGGGTCTCT 

5941 CTGGTTAGAC CAGATCTGAG 
CACTGCTTAA 
5 60 01 GCCTCAATAA AGCTTGCCTT 

TGTGTGACTC 

60 61 TGGTAACTAG AGATCCCTCA 
GCAGCATGTG 

6121 AGCAAAAGGC CAGCAAAAGG 
10 CGTTTTTCCA 

6181 TAGGCTCCGC CCCCCTGACG 
GGTGGC GAAA 

6241 CCCGACAGGA CTATAAAGAT 
TGCGCTCTCC 
15 63 01 TGTTC CGACC CTGCCGCTTA 

GAAGCGTGGC 

63 61 GCTTTCTCAT AGCTCACGCT 
GCTCCAAGCT 

6421 GGGCTGTGTG CACGAACCCC 
20 GTAACTATCG 

64 81 TCTTGAGTCC AACCCGGTAA 
CTGGTAACAG 

6541 GATTAGCAGA GCGAGGTATG 
GGCCTAACTA 
25 6601 CGGCTACACT AGAAGAACAG 

TTACCTTCGG 

6661 AAAAAGAGTT GGTAGCTCTT 
GTGGTTTTTT 

6721 TGTTTGCAAG CAGCAGATTA 
30 CTTTGATCTT 

67 81 TTCTACGGGG TCTGACGCTC 
TGGTCATGAG 

6 841 ATTATCAAAA AGGATCTTCA 
TTAAATCAAT 

35 6901 G TAAAGT AT A TATGAGTAAA 

GTGAGGC AC C 

6961 TATCTCAGCG ATCTGTCTAT 
TCGTGTAGAT 

7 021 AACTACGATA CGGGAGGGCT 
40 CGCGAGACCC 

7 081 ACGCTCACCG GCTCGAGATT 
CCGAGCGCAG 

7141 AAGTGGTCCT GCAACTTTAT 
GGGAAGCTAG 
45 72 01 AGTAAGTAGT TCGCCAGTTA 

CAGGCATCGT 

72 61 GGTGTCACGC TCGTCGTTTG 
GATCAAGGCG 

7321 AGTTACATGA TCCCCCATGT 
50 CTCCGATCGT 

73 81 TGTCAGAAGT AAGTTGGCCG 
TGCATAATTC 

7441 TCTTACTGTC ATGCCATCCG 
CAACCAAGTC 
55 7501 ATT CTGAGAA TAGTGTATGC 

TACGGGATAA 

7561 TACCGCGCCA CATAGCAGAA 
CTTCGGGGCG 



ACATGGCCCG AGAGCTGCAT CCGGACTGTA 
CCTGGGAGCT CTCTGGCTAA CTAGGGAACC 
GAGTGCTTCA AGTAGTGTGT GCCCGTCTGT 
GACCCTTTTA GTCAGTGTGG AAAATCTCTA 
CCAGGAACCG TAAAAAGGCC GCGTTGCTGG 
AGCATCACAA AAATCGACGC TCAAGTCAGA 
ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG 
CCGGATACCT GTCCGCCTTT CTCCCTTCGG 
GTAGGTATCT CAGTTCGGTG TAGGTCGTTC 
CCGTTCAGCC CGACCGCTGC GCCTTATCCG 
GACACGACTT ATCGCCACTG GCAGCAGCCA 
TAGGCGGTGC T AC AGAGTT C TTGAAGTGGT 
TATTTGGTAT CTGCGCTCTG CTGAAGCCAG 
GATCCGGCAA ACAAACCACC GCTGGTAGCG 
CGCGCAGAAA AAAAGGATCT CAAGAAGATC 
AGTGGAACGA AAACTCACGT TAAGGGATTT 
CCTAGATCCT TTTAAATTAA AAATGAAGTT 
CTTGGTCTGA CAGTTACCAA TGCTTAATCA 
TTCGTTCATC CATAGTTGCC TGACTCCCCG 
T AC CATCTGG CCCCAGTGCT GCAATGATAC 
TATCAGCAAT AAACCAGCCA GCCGGAAGGG 
CCGCCTCCAT CCAGTCTATT AATTGTTGCC 
ATAGTTTGCG CAACGTTGTT GCCATTGCTA 
GTATGGCTTC ATTCAGCTCC GGTTCCCAAC 
TGTG CAAAAA AGCGGTTAGC TCCTTCGGTC 
CAGTGTTATC ACTCATGGTT ATGGCAGCAC 
TAAGATGCTT TTCTGTGACT GGTGAGTACT 
GGCGACCGAG TTGCTCTTGC CCGGCGTCAA 
CTTTAAAAGT GCTCATCATT GGAAAACGTT 
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7621 AAAACTCTCA AGGATCTTAC CGCTGTTGAG ATCCAGTTCG ATGTAACCCA 
CTCGTGCACC 

7 681 CAACTGATCT TCAGCATCTT TTACTTTCAC CAGCGTTTCT GGGTGAGCAA 
AAACAGGAAG 

5 7741 GCAAAATGCC GCAAAAAAGG GAATAAGGGC GACACGGAAA TGTTGAATAC 

TCATACTCTT 

7 8 01 CCTTTTTCAA TATTATTGAA GCATTTATCA GGGTTATTGT CTCATGAGCG 
GATACATATT 

7861 TGAATGTATT TAGAAAAATA AACAAATAGG GGTTCCGCGC ACATTTCCCC 
10 GAAAAGTGCC 

7 921 ACCTGAC 

// 

(SEQ ID NO: 7) 

15 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



[00351] pLL3.6 

LOCUS PLENTILOX 
2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 

promoter 



7350 BP DS-DNA 



CIRCULAR SYN 



23- JAN - 



promoter 
gene 

rep_origin 
misc_recomb 
misc__recomb 
LTR 

misc__f eature 
misc__f eature 
mi s cofeature 
LTR 
frag 
frag 
frag 
gene 

1939 A 



Location/ Qualifiers 
212 . . 816 
/note="CMV promoter/enhancer 1" 
2799. .3387 
/note="CMV" 
6357. .7217 
/note="AmpR" 
5539. . 6212 
/note="pUC 11 
2710 . .2745 
/note="Lox 1" 
4150 . .4183 
/note= ,, LoxP" 
835. .1509 

/note^'S' HIV R-U5-del gag (HIV NL4 -3/454-1126) " 
1539. .2396 

/note="HIV RRE (HIV NL4-3/7622-8459) » 
2422 . .2599 
/note="HIV Flap" 
4238 . .4827 
/note="WRE element" 
4847 . . 5536 
/note="3 ■ SIN LTR" 
2772 . .4130 

/note="l to 1359 of Untitledl" 
2772. .2798 

/note="4705 to 4731 of pEGFP-Cl" 
2799. .4127 

/note="l to 1329 of pEGFP-Cl" 
3404 . .4127 
/note="EGFP" 

1795 C 1862 G 1754 T 0 OTHER 



BASE COUNT 
ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGC CAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGC CAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGCCCAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

3 61 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 
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481 TGTATCATAT GCCAAGTACG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC 
TACGTATTAG 
5 601 TCATCGCTAT TACCATGGTG 

GGATAGCGGT 

661 TTGACTCACG GGGATTTCCA 
TTGTTTTGGC 

721 ACCAAAATCA ACGGGACTTT 
10 ACGCAAATGG 

781 GCGGTAGGCG TGTACGGTGG 
TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG 
CCCACTGCTT 
15 901 AAGCCTCAAT AAAGCTTGCC 

GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCGT 
TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA 
20 CGCAGGACTC 

10 81 GGCTTGCTGA AGCGCGCACG 
ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGCTAG 
ATTAAGCGGG 
25 12 01 GGAGAATTAG ATCGCGATGG 

AAAAAATATA 

12 61 AATTAAAACA TATAGTATGG 
AATCCTGGCC 

1321 TGTTAGAAAC AT C AGAAGGC 
30 TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA 
TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC 
GAGCAAAACA 
35 15 01 AAAGTAAGAC CACCGCACAG 

TGGAGGAGGA 

15 61 GATATGAGGG ACAATTGGAG 
AATTGAAC C A 

1621 TTAGGAGTAG CACCCACCAA 
40 AAGAGCAGTG 

1681 GGAATAGGAG CTTTGTTCCT 
GGGCGCAGCG 

1741 TCAATGACGC TGACGGTACA 
GCAGCAGAAC 
45 1801 AATTTGCTGA GGGCTATTGA 

CTGGGGCATC 

18 61 AAGCAGCTCC AGGCAAGAAT 
ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
50 GAATGCTAGT 

1981 TGGAGTAATA AATCTCTGGA 
GTGGGACAGA 

2 041 GAAATTAACA ATTACACAAG 
AAACCAGCAA 
55 2101 GAAAAGAATG AACAAGAATT 

GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
AGGCTTGGTA 



CCCCCTATTG ACGTCAATGA CGGTAAATGG 
TTATGGGACT TTCCTACTTG GCAGTACATC 
ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
AGTCTCCACC CCATTGACGT CAATGGGAGT 
CCAAAATGTC GTAACAACTC CGCCCCATTG 
GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTG CG A GAGCGTCAGT 
GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TAC TGGGAC A GCTACAACCA 
TCATTATATA ATACAGTAGC AACCCTCTAT 
ACCAAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
AAGTGAATTA TATAAATATA AAGTAGTAAA 
GGGAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGCCAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGCTGTG GAAAG AT AC C TAAAGGATCA 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAATCGCA 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
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2221 GGTTTAAGAA TAGTTTTTGC 
GGGATATTCA 

22 81 CCATTATCGT TTCAGACCCA 
CGAAGGAATA 
5 2 341 GAAGAAGAAG GTGGAGAGAG 

CGGATCGGCA 

24 01 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 

24 61 GGGGATTGGG GGGTACAGTG 
10 CAGACATACA 

2521 AACTAAAGAA TTACAAAAAC 
ATTACAGGGA 

25 81 CAGCAGAGAT CCAGTTTGGT 
CGCTAGCCGT 

15 2 641 TAATTAAGCC TCGAGGTCGA 

CAGCAGGTCG 

27 01 AGGGACCTAA TAACTTCGTA 
GTTCCAAGCT 

2 761 TAAGCGGCCG CGTGGATAAC 
20 GTAATCAATT 

2 821 ACGGGGTCAT TAGTTCATAG 
TACGGTAAAT 

2 8 81 GGCCCGCCTG GCTGACCGCC 
GACGTATGTT 

25 2 941 CCCATAGTAA CGCCAATAGG 

TTTACGGTAA 

3 0 01 ACTGCCCACT TGGCAGTACA 
TATTGACGTC 

3 061 AATGACGGTA AATGGCCCGC 

30 GGACTTTCCT 

3121 ACTTGGCAGT ACATCTACGT 
GTTTTGGCAG 

3181 TACATCAATG GGCGTGGATA 
CCACCCCATT 
35 3 241 GACGTCAATG GGAGTTTGTT 

ATGTCGTAAC 

3 3 01 AACTCCGCCC CATTGACGCA 
CTATATAAGC 

3 3 61 AGAGCTGGTT TAGTGAACCG 
40 ATGGTGAGCA 

3421 AGGGCGAGGA GCTGTTCACC 
GGCGACGTAA 

34 81 ACGGCCACAA GTTCAGCGTG 
GGCAAGCTGA 
45 3 541 CCCTGAAGTT CATCTGCACC 

CTCGTGACCA 

3 601 CCCTGACCTA CGGCGTGCAG 
CAGCACGACT 

3 661 TCTTCAAGTC CGCCATGCCC 
50 TTCAAGGACG 

3 721 ACGGCAACTA CAAGAGCCGC 
GTGAACCGCA 

3 781 TCGAGCTGAA GGGCATCGAC 
AAGCTGGAGT 
55 3 841 ACAACTACAA CAGCCACAAC 

GGCATCAAGG 

3 901 TGAACTTCAA GATCCGCCAC 
GACCACTACC 



TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
GACAAATGGC AGTATTCATC CACAATTTTA 
CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAAT T AC AAA AATT CAAAAT TTTCGGGTTT 
TAGTACCGGG CCCGCTCTAG ACGGTTAACG 
CGGTATCGAT AAGCTCGCTT C ACGAGATT C 
TAGCATACAT TATACGAAGT TATATTAAGG 
CGTATTACCG CCATGCATTA GTTATTAATA 
CCCATATATG GAGTTCCGCG TTACATAACT 
CAACGACCCC CGCCCATTGA CGTCAATAAT 
GACTTTCCAT TGACGTCAAT GGGTGGAGTA 
TCAAGTGTAT CATATGCCAA GTACGCCCCC 
CTGGCATTAT GCCCAGTACA TGACCTTATG 
ATTAGTCATC GCTATTACCA TGGTGATGCG 
GCGGTTTGAC TCACGGGGAT TTCCAAGTCT 
TTGGCACCAA AATCAACGGG ACTTTCCAAA 
AATGGGCGGT AGGCGTGTAC GGTGGGAGGT 
TCAGATCCGC TAGCGCTACC GGTCGCCACC 
GGGGTGGTGC CCATCCTGGT CGAGCTGGAC 
TCCGGCGAGG GCGAGGGCGA TGCCACCTAC 
ACCGGCAAGC TGCCCGTGCC CTGGCCCACC 
TGCTTCAGCC GCTACCCCGA CCACATGAAG 
GAAGGCTACG TCCAGGAGCG CACCATCTTC 
GCCGAGGTGA AGTTCGAGGG CGACACCCTG 
TTCAAGGAGG ACGGCAACAT CCTGGGGCAC 
GTCTATATCA TGGCCGACAA GCAGAAGAAC 
AACATCGAGG ACGGCAGCGT GCAGCTCGCC 
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3 961 AGCAGAACAC CCCCATCGGC 
TACCTGAGCA 

4 021 CCCAGTCCGC CCTGAGCAAA 
CTGCTGGAGT 

5 4 081 TCGTGACCGC CGCCGGGATC 

GAATTCGTCG 

4141 AGGGAC CTAA TAACTTCGTA 
TAAGGGTTCC 

42 01 GGTTCCACTA GGTACAATTC 
10 GATTACAAAA 

42 61 TTTGTGAAAG ATTGACTGGT 
TGTGGATACG 

43 21 CTGCTTTAAT GCCTTTGTAT 
TTCTCCTCCT 

15 43 81 TGTATAAATC CTGGTTGCTG 

AGGCAACGTG 

4441 GCGTGGTGTG CACTGTGTTT 
GCCACCACCT 

45 01 GTCAGCTCCT TTCCGGGACT 
20 GAACTCATCG 

45 61 CCGCCTGCCT TGCCCGCTGC 
AATTCCGTGG 

4 621 TGTTGTCGGG GAAATCATCG 
ACCTGGATTC 
25 4 6 81 TGCGCGGGAC GTCCTTCTGC 

CTTCCTTCCC 

4 741 GCGGCCTGCT GCCGGCTCTG 
CAGACGAGTC 

4 8 01 GGATCTCCCT TTGGGCCGCC 
30 AGAC CTAGAA 

4 8 61 AAACATGGAG CAATCACAAG 
TGCCTGGCTA 

4 921 GAAGCACAAG AGGAGGAGGA 
TTTAAGACCA 

35 4 981 ATG AC TTAC A AGGCAGCTGT 

GGGACTGGAA 

5 041 GGGCTAATTC ACTCCCAACG 
CCACACACAA 

5101 GGCTACTTCC CTGATTGGCA 
40 TCCACTGACC 

5161 TTTGGATGGT GCTACAAGCT 
AGCCAATGAA 

52 21 GGAGAGAACA CCCGCTTGTT 
CCCGGAGAGA 
45 52 81 GAAGTATTAG AGTGGAGGTT 

CCGAGAGCTG 

5 341 CATCCGGACT GTACTGGGTC 
GCTCTCTGGC 

54 01 TAACTAGGGA ACCCACTGCT 
50 TCAAGTAGTG 

54 61 TGTGCCCGTC TGTTGTGTGA 
TTAGTCAGTG 

5521 TGGAAAATCT CTAGCAGCAT 
CCGTAAAAAG 
55 5581 GCCGCGTTGC TGGCGTTTTT 

CAAAAATCGA 

5641 CGCTCAAGTC AGAGGTGGCG 
GTTTCCCCCT 



GACGGCCCCG TGCTGCTGCC CGACAACCAC 
GACCCCAACG AGAAGCGCGA TCACATGGTC 
ACTCTCGGCA TGGACGAGCT GTACAAGTAG 
TAGCATACAT TATACGAAGT TATACATGTT 
GATATCAAGC TTAT CGATAA TCAACCTCTG 
ATTCTTAACT ATGTTGCTCC TTTTACGCTA 
CATGCTATTG CTTCCCGTAT GGCTTTCATT 
TCTCTTTATG AGGAGTTGTG GCCCGTTGTC 
GCTGACGCAA CCCCCACTGG TTGGGGCATT 
TTCGCTTTCC CCCTCCCTAT TGC CACGGCG 
TGGACAGGGG CTCGGCTGTT GGGCACTGAC 
TCCTTTCCTT GGCTGCTCGC CTGTGTTGCC 
TACGTCCCTT CGGCCCTCAA TCCAGCGGAC 
CGGCCTCTTC CGCGTCTTCG CCTTCGCCCT 
TCCCCGCATC GATACCGTCG ACCTCGATCG 
TAGCAATACA GCAGCTACCA ATGCTGATTG 
GGTGGGTTTT CCAGTCACAC CTCAGGTACC 
AGATCTTAGC CACTTTTTAA AAGAAAAGGG 
AAGACAAGAT ATCCTTGATC TGTGGATCTA 
G AAC T AC AC A CCAGGGCCAG GGATCAGATA 
AGTACCAGTT GAGCAAGAGA AGGTAGAAGA 
ACACCCTGTG AGCCTGCATG GGATGGATGA 
TGACAGCCGC CTAGCATTTC ATCACATGGC 
TCTCTGGTTA GACCAGATCT GAGCCTGGGA 
TAAGCCTCAA TAAAGCTTGC CTTGAGTGCT 
CTCTGGTAAC TAGAGATCCC TCAGACCCTT 
GTGAGCAAAA GGCCAGCAAA AGGCCAGGAA 
CCATAGGCTC CGCCCCCCTG ACGAGCATCA 
AAACCCGACA GGACTATAAA GATACCAGGC 
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5 701 GGAAGCTCCC TCGTGCGCTC TCCTGTTCCG ACCCTGCCGC TTACCGGATA 
CCTGTCCGCC 

5 761 TTTCTCCCTT CGGGAAGCGT GGCGCTTTCT CATAGCTCAC GCTGTAGGTA 
TCTCAGTTCG 

5 5821 GTGTAGGTCG TTCGCTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 

GCCCGACCGC 

5881 TGCGCCTTAT CCGGTAACTA TCGTCTTGAG TCCAACCCGG TAAGACACGA 
CTTATCGCCA 

5 941 CTGGCAGCAG CCACTGGTAA CAGGATTAGC AGAG CGAGGT ATGTAGGCGG 
10 TGCTACAGAG 

60 01 TTCTTGAAGT GGTGGCCTAA CTACGGCTAC ACTAGAAGAA CAGTATTTGG 
TATCTGCGCT 

6 061 CTGCTGAAGC CAGTTACCTT CGGAAAAAGA GTTGGTAGCT CTTGATCCGG 
CAAACAAACC 

15 6121 ACCGCTGGTA GCGGTGGTTT TTTTGTTTGC AAGCAGCAGA TTACGCGCAG 

AAAAAAAGGA 

6181 TCTCAAGAAG ATCCTTTGAT CTTTTCTACG GGGTCTGACG CTCAGTGGAA 
CGAAAACTCA 

6241 CGTTAAGGGA TTTTGGTCAT GAGATTATCA AAAAGGATCT TCACCTAGAT 
20 CCTTTTAAAT 

63 01 TAAAAATGAA GTTTTAAATC AATCTAAAGT ATATATGAGT AAACTTGGTC 
TGACAGTTAC 

63 61 CAATGCTTAA TCAGTGAGGC ACCTATCTCA GCGATCTGTC TATTTCGTTC 
ATCCATAGTT 

25 6421 GCCTGACTCC CCGTCGTGTA GATAACTACG ATACGGGAGG GCTTACCATC 

TGGCCCCAGT 

64 81 GCTGCAATGA TACCGCGAGA CCCACGCTCA CCGGCTCCAG ATTTATCAGC 
AATAAACCAG 

6541 CCAGCCGGAA GGGCCGAGCG CAGAAGTGGT CCTGCAACTT TATCCGCCTC 
30 CATCCAGTCT 

6601 ATTAATTGTT GCCGGGAAGC TAGAGTAAGT AGTTCGCCAG TTAATAGTTT 
GCGCAACGTT 

6661 GTTGCCATTG CTACAGGCAT CGTGGTGTCA CGCTCGTCGT TTGGTATGGC 
TTCATTCAGC 

35 6721 TCCGGTTCCC AACGATCAAG GCGAGTTACA TGATCCCCCA TGTTGTGCAA 

AAAAGCGGTT 

67 81 AGCTCCTTCG GTCCTCCGAT CGTTGTCAGA AGTAAGTTGG CCGCAGTGTT 
ATCACTCATG 

6841 GTTATGGCAG CACTGCATAA TTCTCTTACT GTCATGCCAT CCGTAAGATG 
40 CTTTTCTGTG 

6901 ACTGGTGAGT ACTCAACCAA GTCATTCTGA GAATAGTGTA TGCGGCGACC 
GAGTTGCTCT 

6961 TGCCCGGCGT CAATACGGGA TAATACCGCG C C AC AT AG C A GAACTTTAAA 
AGTGCTCATC 

45 7 021 ATTGGAAAAC GTTCTTCGGG GCGAAAACTC TCAAGGATCT TACCGCTGTT 

GAGATCCAGT 

7081 TCGATGTAAC CCACTCGTGC AC C CAACTGA TCTTCAGCAT CTTTTACTTT 
CACCAGCGTT 

7141 TCTGGGTGAG CAAAAACAGG AAGGCAAAAT GCCGCAAAAA AGGGAATAAG 
50 GGCGACACGG 

72 01 AAATGTTGAA TACTCATACT CTTCCTTTTT CAATATTATT GAAGCATTTA 
TCAGGGTTAT 

72 61 TGTCTCATGA GCGGATACAT ATTTGAATGT ATTTAGAAAA ATAAACAAAT 
AGGGGTTCCG 

55 7321 CGCACATTTC CCCGAAAAGT GCCACCTGAC 

// 

(SEQ ID NO: 8) 
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[00352] pLL3.7 



PLL3 . 7 . GB 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



LOCUS 
2002 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES 

promoter 



misc_recomb 

promoter 

gene 

rep_origin 

misc_recomb 

LTR 

mi sc_f eature 

misc_feature 

misc_f eature 

LTR 

frag 

frag 

frag 

gene 

frag 

frag 

source 

promoter 
mi s c_f eature 
misc_f eature 
mi sc_f eature 
promoter 
to PSE or 



7 65 0 BP DS-DNA CIRCULAR SYN 



23-JAN- 



Location/Qualif iers 
212 . . 816 

/note="CMV promoter/enhancer 1" 

4450 . .4483 

/note="Lox 2" 

3099. .3687 

/note="CMV" 

6657 . .7517 

/note="AmpR" 

5839. . 6512 

/note="pUC" 

3010. .3045 

/note="Lox 1" 

835 . . 1509 

/note="5» HIV R-U5-del gag (HIV NL4 -3/454 - 1126) " 
1539. .2396 

/note="HIV RRE (HIV NL4-3/7622-8459) " 

2422 . .2599 

/note=»HIV Flap" 

4538 . .5127 

/note="WRE element" 

5147. .5836 

/note="3 ' SIN LTR" 

3072 . .4430 

/note="l to 1359 of Untitledl" 
3072 . .3098 

/note="4705 to 4731 of pEGFP-Cl" 
3099. .4427 

/note="l to 1329 of pEGFP-Cl" 
3704 . .4427 
/note="EGFP" 
2617 . .2950 

/note="l to 334 of Untitled^" 
2622 . .2935 

/note="l to 314 of mouseu6" 
2622 . . >2935 

/organism="Mus mus cuius" 
/db_xref="taxon: 10090" 
/clone="pmU6-52BE [Split] " 
2622 . . >2935 

/note="U6 Promoter [Split] » 
2648 . .2658 

/note="pot. SPI binding site" 
2692.. 2701 

/note="pot. SPI binding site" 
2707. .2714 

/note= "pot . enhancer " 
2869. .2888 

/note="pot. promoter region; sequence homologous 
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element 'B'" 
promoter 2906. .2911 

/note="put . TATA -box" 
BASE COUNT 2032 A 1861 C 1917 G 1840 T 0 OTHER 

5 ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAA&CCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

10 121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 

ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGC CAGATAT 
ACGGGTTGAC 

241 ATTGATTATT GAC TAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
15 CATAGCCCAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

361 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

20 421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 

GTACATCAAG 

481 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
25 TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 

661 TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 
TTGTTTTGGC 

30 721 AC CAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 

ACGCAAATGG 

781 GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
35 CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
TAGCAGTGGC 

40 1021 GCCCGAACAG GGACTTGAAA GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 

CGCAGGACTC 

1081 GGCTTGCTGA AGCGCGCACG GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGCTAG AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
45 ATTAAGCGGG 

12 01 GGAGAATTAG ATCGCGATGG GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
AAAAAATATA 

12 61 AATTAAAACA TATAGTATGG GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
AATCCTGGCC 

50 1321 TGTTAGAAAC ATCAGAAGGC TGTAGACAAA TACTGGGACA GCTACAACCA 

TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA TCATTATATA ATACAGTAGC AACCCTCTAT 
TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC AC CAAGGAAG CTTTAGACAA GATAGAGGAA 
55 GAGCAAAACA 

15 01 AAAGTAAGAC CACCGCACAG CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
TGGAGGAGGA 

1561 GATATGAGGG ACAATTGGAG AAGTGAATTA TATAAATATA AAGTAGTAAA 
AATTGAACCA 
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1621 TTAGGAGTAG CACCCACCAA 
AAGAGCAGTG 

1681 GGAATAGGAG CTTTGTTCCT 
GGGCGCAGCG 
5 1741 TCAATGACGC TGACGGTACA 

GCAGCAGAAC 

18 01 AATTTGCTGA GGGCTATTGA 
CTGGGGCATC 

18 61 AAGCAGCTCC AGGCAAGAAT 
10 ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
GAATGCTAGT 

19 81 TGGAGTAATA AATCTCTGGA 
GTGGGACAGA 

15 2 041 GAAATTAACA ATTACACAAG 

AAACCAGCAA 

2101 GAAAAGAATG AACAAGAATT 
GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
20 AGGCTTGGTA 

2 221 GGTTTAAGAA TAGTTTTTGC 
GGGATATTCA 

22 81 CCATTATCGT TTCAGACCCA 
CGAAGGAATA 
25 2341 GAAGAAGAAG GTGGAGAGAG 

CGGATCGGCA 

24 01 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 

24 61 GGGGATTGGG GGGTACAGTG 
30 C AG AC AT AC A 

2 521 AACTAAAGAA TTACAAAAAC 
ATTACAGGGA 

25 81 CAGCAGAGAT CCAGTTTGGT 
CCGCCATCTC 

35 2 641 TAGGCCCGCG CCGGCCCCCT 

TCCCCTGCCC 

2 701 CGGTTAATTT GCATATAATA 
GATAAAAGAC 

2 761 AGATAATCTG TTCTTTTTAA 
40 ATTTCTATAA 

2 821 GAGATACAAA TACTAAATTA 
CACCCTAACT 

2 8 81 GTAAAGTAAT TGTGTGTTTT 
TTGTTAACGC 

45 2 941 GCGGTGACCC TCGAGGTCGA 

CAGCAGGTCG 

3 0 01 AGGGACCTAA TAACTTCGTA 
GTTCCAAGCT 

3 0 61 TAAGCGGCCG CGTGGATAAC 
50 GTAATCAATT 

3121 ACGGGGTCAT TAGTTCATAG 
TACGGTAAAT 

3181 GGCCCGCCTG GCTGACCGCC 
GACGTATGTT 
55 3241 CCCATAGTAA CGC CAATAGG 

TTTACGGTAA 

33 01 ACTGCCCACT TGGCAGTACA 
TATTGACGTC 



GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGCCAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGCTGTG G AAAG AT AC C TAAAGGAT C A 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAATCGCA 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
GACAAATGGC AGTATTCATC CACAATTTTA 
CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGT AC CGGG CCCGCTCTAG AGATCCGACG 
CGCACAGACT TGTGGGAGAA GCTCGGCTAC 
TTTCCTAGTA ACTATAGAGG CTTAATGTGC 
TAC TAGCTAC ATTTTACATG ATAGGCTTGG 
TTATTTTAAA AAACAGCACA AAAGGAAACT 
GAGACTATAA ATATCCCTTG GAGAAAAGCC 
CGGTATCGAT AAGCTCGCTT CACGAGATTC 
TAGCATACAT TATACGAAGT TATATTAAGG 
CGTATTACCG CCATGCATTA GTTATTAATA 
CCCATATATG GAGTTCCGCG TTACATAACT 
CAACGACCCC CGCCCATTGA CGTCAATAAT 
GACTTTCCAT TGACGTCAAT GGGTGGAGTA 
TCAAGTGTAT CATATGCCAA GTACGCCCCC 
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33 61 AATGACGGTA 
GGACTTTCCT 

3421 ACTTGGCAGT 
GTTTTGGCAG 

34 81 TACATCAATG 
CCACCCCATT 

3541 GACGTCAATG 
ATGTCGTAAC 

36 01 AACTCCGCCC 
CTATATAAGC 

3661 AGAGCTGGTT 
ATGGTGAGCA 

3 721 AGGGCGAGGA 
GGCGACGTAA 

37 81 ACGGCCACAA 
GGCAAGCTGA 

3 841 CCCTGAAGTT 
CTCGTGACCA 

39 01 CCCTGACCTA 
CAGCACGACT 

3 961 TCTTCAAGTC 
TTCAAGGACG 

4 021 ACGGCAACTA 
GTGAACCGCA 

40 81 TCGAGCTGAA 
AAGCTGGAGT 

4141 ACAACTACAA 
GGCATCAAGG 

42 01 TGAACTTCAA 
GACCACTACC 

42 61 AGCAGAACAC 
TAC CTGAGCA 

4321 CCCAGTCCGC 
CTGCTGGAGT 

43 81 TCGTGACCGC 
GAATTCGTCG 

4441 AGGGACCTAA 
TAAGGGTTCC 

45 01 GGTTCCACTA 
GATTACAAAA 

4561 TTTGTGAAAG 
TGTGGATACG 

4 621 CTGCTTTAAT 
TTCTCCTCCT 

4681 TGTATAAATC 
AGGCAACGTG 

4 741 GCGTGGTGTG 
GCCACCACCT 

4801 GTCAGCTCCT 
GAACTCATGG 

4861 CCGCCTGCCT 
AATTCCGTGG 

4921 TGTTGTCGGG 
ACCTGGATTC 

4981 TGCGCGGGAC 
CTTCCTTCCC 

5 041 GCGGCCTGCT 
CAGACGAGTC 



AATGGCCCGC CTGGCATTAT 
ACATCTACGT ATTAGTCATC 
GGCGTGGATA GCGGTTTGAC 
GGAGTTTGTT TTGGCACCAA 
CATTGACGCA AATGGGC GGT 
TAGTGAACCG TCAGATCCGC 
GCTGTTCACC GGGGTGGTGC 
GTTCAGCGTG TCCGGCGAGG 
CATCTGCACC ACCGGCAAGC 
CGGCGTGCAG TGCTTCAGCC 
CGCCATGCCC GAAGGCTACG 
CAAGACCCGC GCCGAGGTGA 
GGGCATCGAC TTCAAGGAGG 
CAGCCACAAC GTCTATATCA 
GATCCGCCAC AACATCGAGG 
CCCCATCGGC GACGGCCCCG 
CCTGAGCAAA GACCCCAACG 
CGCCGGGATC ACTCTCGGCA 
TAACTTCGTA TAGCATACAT 
GGTACAATTC GATATCAAGG 
ATTGACTGGT ATTCTTAACT 
GC CTTTGTAT CATGCTATTG 
CTGGTTGCTG TCTCTTTATG 
CACTGTGTTT GCTGACGCAA 
TTCCGGGACT TTCGCTTTCC 
TGCCCGCTGC TGGACAGGGG 
GAAATCATCG TCCTTTCCTT 
GTCCTTCTGC TACGTCCCTT 
GCCGGCTCTG CGGCCTCTTC 



GCCCAGTACA TGACCTTATG 
GCTATTACCA TGGTGATGCG 
TCACGGGGAT TTCCAAGTCT 
AATCAACGGG ACTTTCCAAA 
AGGCGTGTAC GGTGGGAGGT 
TAGCGCTACC GGTCGCCACC 
CCATCCTGGT CGAGCTGGAC 
GCGAGGGCGA TGCCACCTAC 
TGCCCGTGCC CTGGCCCACC 
GCTACCCCGA CCACATGAAG 
TCCAGGAGCG CACCATCTTC 
AGTTCGAGGG CGACACCCTG 
ACGGCAACAT CCTGGGGCAC 
TGGCCGACAA GCAGAAGAAC 
ACGGCAGCGT GCAGCTCGCC 
TGCTGCTGCC CGACAACCAC 
AGAAGCGCGA TCACATGGTC 
TGGACGAGCT GTACAAGTAG 
TATACGAAGT TATACATGTT 
TTATCGATAA TCAACCTCTG 
ATGTTGCTCC TTTTACGCTA 
CTTCCCGTAT GGCTTTCATT 
AGGAGTTGTG GCCCGTTGTC 
CCCCCACTGG TTGGGGCATT 
CCCTCCCTAT TGCCACGGCG 
CTCGGCTGTT GGGCACTGAC 
GGCTGCTCGC CTGTGTTGCC 
CGGCCCTCAA TCCAGCGGAC 
CGCGTCTTCG CCTTCGCCCT 
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5101 GGATCTCCCT 
AGACCTAGAA 

5161 AAACATGGAG 
TGCCTGGCTA 

52 21 GAAGCACAAG 
TTTAAGACCA 

52 81 ATGACTTACA 
GGGAC TGGAA 

5341 GGGCTAATTC 
CCACACACAA 

5401 GGCTACTTCC 
TCCACTGACC 

5461 TTTGGATGGT 
AGC CAATGAA 

5521 GGAGAGAACA 
CCCGGAGAGA 

55 81 GAAGTATTAG 
CCGAGAGCTG 

5641 CATCCGGACT 
GCTCTCTGGC 

57 01 TAACTAGGGA 
TCAAGTAGTG 

5761 TGTGCCCGTC 
TTAGTCAGTG 

5821 TGGAAAATCT 
CCGTAAAAAG 

58 81 GCCGCGTTGC 
CAAAAATCGA 

5941 CGCTCAAGTC 
GTTTCCCCCT 

6001 GGAAGCTCCC 
CCTGTCCGCC 

6061 TTTCTCCCTT 
TCTCAGTTCG 

6121 GTGTAGGTCG 
GCCCGACCGC 

6181 TGCGCCTTAT 
CTTATCGCCA 

6241 CTGGCAGCAG 
TGCTACAGAG 

63 01 TTCTTGAAGT 
TATCTGCGCT 

63 61 CTGCTGAAGC 
CAAACAAACC 

6421 ACCGCTGGTA 
AAAAAAAGGA 

64 81 TCTCAAGAAG 
CGAAAACTCA 

6541 CGTTAAGGGA 
CCTTTTAAAT 

6601 TAAAAATGAA 
TGACAGTTAC 

6661 CAATGCTTAA 
ATCCATAGTT 

6721 GCCTGACTCC 
TGGCCCCAGT 

67 81 GCTGCAATGA 
AATAAACCAG 



TTGGGCCGCC TCCCCGCATC 
CAATCACAAG TAGCAATACA 
AGGAGGAGGA GGTGGGTTTT 
AGGCAGCTGT AGATCTTAGC 
AGTCCCAACG AAGACAAGAT 
CTGATTGGCA GAACTACACA 
GCTACAAGCT AGTACCAGTT 
CCCGCTTGTT ACACCCTGTG 
AGTGGAGGTT TGACAGCCGC 
GTACTGGGTC TCTCTGGTTA 
ACCCACTGCT TAAGCCTCAA 
TGTTGTGTGA CTCTGGTAAC 
CTAGCAGCAT GTGAGCAAAA 
TGGCGTTTTT CCATAGGCTC 
AGAGGTGGCG AAAC CCGAC A 
TCGTGCGCTC TCCTGTTCCG 
CGGGAAGCGT GGCGCTTTCT 
TTCGCTCCAA GCTGGGCTGT 
CCGGTAACTA TCGTCTTGAG 
CCACTGGTAA CAGGATTAGC 
GGTGGCCTAA CTACGGCTAC 
CAGTTACCTT CGGAAAAAGA 
GCGGTGGTTT TTTTGTTTGC 
ATCCTTTGAT CTTTTCTACG 
TTTTGGTCAT GAGATTATCA 
GTTTTAAATC AATCTAAAGT 
TCAGTGAGGC ACCTATCTCA 
CCGTCGTGTA GATAACTACG 
TACCGCGAGA CCCACGCTCA 



GATAC CGTCG ACCTCGATCG 
GCAGCTACCA ATGCTGATTG 
CCAGTCACAC CTCAGGTACC 
CACTTTTTAA AAGAAAAGGG 
ATCCTTGATC TGTGGATCTA 
CCAGGGCCAG GGATCAGATA 
GAGCAAGAGA AGGTAGAAGA 
AGCCTGCATG GGATGGATGA 
CTAGCATTTC ATCACATGGC 
GACCAGATCT GAGCCTGGGA 
TAAAGCTTGC CTTGAGTGCT 
TAGAGATCCC TCAGACCCTT 
GGCCAGCAAA AGGCCAGGAA 
CGCCCCCCTG ACGAGCATCA 
GGAC TAT AAA GATAC C AGGC 
ACCCTGCCGC TTACCGGATA 
CATAGCTCAC GCTGTAGGTA 
GTGCACGAAC CCCCCGTTCA 
TCCAACCCGG TAAGACACGA 
AGAGCGAGGT ATGTAGGCGG 
ACTAGAAGAA CAGTATTTGG 
GTTGGTAGCT CTTGATCCGG 
AAGCAGCAGA TTACGCGCAG 
GGGTCTGACG CTCAGTGGAA 
AAAAGGATCT TCAC CTAGAT 
ATATATGAGT AAACTTGGTC 
GCGATCTGTC TATTTCGTTC 
ATACGGGAGG GCTTACCATC 
CCGGCTCCAG ATTTATCAGC 
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6841 CCAGCCGGAA GGGCCGAGCG 
CATCCAGTCT 

6901 ATTAATTGTT GCCGGGAAGC 
GCGCAACGTT 
5 6961 GTTGCCATTG CTACAGGCAT 

TTCATTCAGC 

7 021 TCCGGTTCCC AACGATCAAG 
AAAAGCGGTT 

7081 AGCTCCTTCG GTCCTCCGAT 
10 ATCACTCATG 

7141 GTTATGGCAG CACTGCATAA 
CTTTTCTGTG 

72 01 AGTGGTGAGT ACTCAACCAA 
GAGTTGCTCT 

15 72 61 TGCCCGGCGT CAATACGGGA 

AGTGCTCATC 

73 21 ATTGGAAAAC GTTCTTCGGG 
GAGATCCAGT 

73 81 TCGATGTAAC CCACTCGTGC 
20 CACCAGCGTT 

7441 TCTGGGTGAG CAAAAACAGG 
GGCGACACGG 

75 01 AAATGTTGAA TACTCATACT 
TCAGGGTTAT 
25 7561 TGTCTCATGA GCGGATACAT 

AGGGGTTCCG 

7 621 CGCACATTTC CCCGAAAAGT 

// 

30 (SEQ ID NO: 9) 



CAGAAGTGGT CCTGCAACTT TATCCGCCTC 
TAGAGTAAGT AGTTCGCCAG TTAATAGTTT 
CGTGGTGTCA CGCTCGTCGT TTGGTATGGC 
GCGAGTTACA TGATCCCCCA TGTTGTGCAA 
CGTTGTCAGA AGTAAGTTGG CCGCAGTGTT 
TTCTCTTACT GTCATGCCAT CCGTAAGATG 
GTCATTCTGA GAATAGTGTA TGCGGCGACC 
TAATACCGCG C C AC ATAGC A GAAC TTTAAA 
GCGAAAACTC TCAAGGATCT TACCGCTGTT 
ACCCAACTGA TCTTCAGCAT CTTTTACTTT 
AAGGCAAAAT GCCGCAAAAA AGGGAATAAG 
CTTCCTTTTT CAATATTATT GAAGCATTTA 
ATTTGAATGT ATTTAGAAAA ATAAACAAAT 
GCCACCTGAC 



35 
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Equivalents 

20 [00353] Those skilled in the art will recognize, or be able to ascertain using no 

more than routine experimentation, many equivalents to the specific embodiments of 
the invention described herein. The Examples below are provided to illustrate the 
invention and are not limiting. Alternative procedures known to one of ordinary skill 
in the art might also be used. The scope of the present invention is not intended to be 

25 limited to the above Description, but rather is as set forth in the appended claims. 
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1 We claim: 

2 1. A lentiviral vector comprising the following elements: a nucleic acid whose 

3 sequence includes (i) a functional packaging signal; (ii) a multiple cloning site 

4 (MCS); and (iii) at least one additional element selected from the group 

5 consisting of: a second MCS, a second MCS into which a heterologous nucleic 

6 acid is inserted, an HIV FLAP element, an expression-enhancing 

7 posttranscriptional regulatory element, a target site for a site-specific 

8 recombinase, and a self-inactivating (SIN) LTR, wherein the lentiviral vector 

9 is a lentiviral transfer plasmid or an infectious lentiviral particle. 

10 2. The lentiviral vector of claim 1, wherein the vector comprises at least two 

1 1 elements selected from the group consisting of: a second MCS, a second MCS 

12 into which a heterologous nucleic acid is inserted, an HIV FLAP element, an 

13 expression-enhancing posttranscriptional regulatory element, a target site for a 

14 site-specific recombinase, and a self-inactivating (SIN) LTR. 

15 3. The lentiviral vector of claim 1, wherein the vector comprises at least three 

16 elements selected from the group consisting of: a second MCS, a second MCS 

17 into which a heterologous nucleic acid is inserted, an HIV FLAP element, an 

18 expression-enhancing posttranscriptional regulatory element, a target site for a 

19 site-specific recombinase, and a self-inactivating (SIN) LTR. 

20 4. The lentiviral vector of claim 1 , wherein the vector comprises at least four 

21 elements selected from the group consisting of: a second MCS, a second MCS 

22 into which a heterologous nucleic acid is inserted, an HIV FLAP element, an 

23 expression-enhancing posttranscriptional regulatory element, a target site for a 

24 site-specific recombinase, and a self-inactivating (SIN) LTR. 

25 5 . The lentiviral vector of claim 1 , wherein the vector comprises a second MCS, 

26 an HIV FLAP element, an expression-enhancing posttranscriptional regulatory 

27 element, a target site for a site-specific recombinase, and a self-inactivating 

28 (SIN) LTR. 
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1 6. The lentiviral vector of claim 1 , wherein the vector comprises a second MCS 

2 into which a heterologous nucleic acid is inserted, an HIV FLAP element, an 

3 expression-enhancing posttranscriptional regulatory element, a target site for a 

4 site-specific recombinase, and a self-inactivating (SIN) LTR. 

5 7. The lentiviral vector of claim 1 , wherein the additional element is a second 

6 MCS. 

7 8. The lentiviral vector of claim 1, wherein the additional element is a second 

8 MCS into which a heterologous nucleic acid is inserted. 

9 9. The lentiviral vector of claim 1 , wherein the vector has unique restriction sites 

10 for at least 4 enzymes selected from the group consisting of NotI, Apal, Xhol, 

1 1 Xbal, Hpal, Nhel, Pad, Nsil, SphI, Sma/Xma, AccI, BamHI, and SphL 

12 10. The lentiviral vector of claim 1 , wherein the vector has unique restriction sites 

13 for at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 1 1, at 

14 least 12, or at least 13 enzymes selected from the group consisting of NotI, 

15 Apal, Xhol, Xbal, Hpal, Nhel, Pad, Nsil, SphI, Sma/Xma, AccI, BamHI, and 

16 SphI. 

17 11. The lentiviral vector of claim 1 , wherein the additional element is an HIV 

18 FLAP element. 

19 12. The lentiviral vector of claim 1 , wherein the additional element is an 

20 expression-enhancing posttranscriptional regulatory element. 

21 13. The lentiviral vector of claim 1 2, wherein the expression-enhancing 

22 posttranscriptional regulatory element is a WRE. 

23 14. The lentiviral vector of claim 1, wherein the additional element is a target site 

24 for a site-specific recombinase. 

25 15. The lentiviral vector of claim 14, wherein the site is a loxP site. 
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1 16. The lentiviral vector of claim 1, wherein the lentiviral vector is a lentiviral 

2 transfer plasmid. 

3 17. The lentiviral transfer plasmid of claim 16, wherein the plasmid has a size of 

4 less than 10 kB. 

5 18. The lentiviral transfer plasmid of claim 16, wherein the plamid has a size of 

6 less than 9 kB. 

7 19. The lentiviral transfer plasmid of claim 16, wherein the plasmid has a size of 

8 less than 8 kB. 

9 20. The lentiviral transfer plasmid of claim 16, wherein the plasmid has a size of 

10 less than 7 kB. 

11 21 . The lentiviral transfer plasmid of claim 16, wherein the plasmid has a size of 

1 2 approximately 6 kB . 

13 22. The lentiviral vector of claim 1, wherein the lentiviral vector is an infectious 

1 4 lentiviral particle. 

15 23. The lentiviral vector of claim 1, further comprising: a heterologous promoter 

16 or promoter-enhancer. 

1 7 24. The lentiviral vector of claim 23, wherein the heterologous promoter or 

18 promoter-enhancer is selected from the group consisting of: the CMV 

19 promoter, the CMV promoter-enhancer, and the ubiquitin C promoter. 

20 25. The lentiviral vector of claim 24, wherein the heterologous promoter is an 

2 1 inducible promoter. 

22 26. The lentiviral vector of claim 24, wherein the heterologous promoter is a cell 

23 type specific or tissue specific promoter. 

24 27. The lentiviral vector of claim 23, wherein the heterologous promoter is an 

25 RNA polymerase promoter. 
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1 28. The lentiviral vector of claim 27, wherein the RNA polymerase promoter is an 

2 RNA polymerase III promoter. 

3 29. The lentiviral vector of claim 28, wherein the RNA polymerase III promoter is 

4 a U6 promoter. 

5 30. The lentiviral vector of claim 28, wherein the RNA polymerase III promoter is 

6 an HI promoter. 

7 31. The lentiviral vector of claim 27, wherein the RNA polymerase promoter is an 

8 RNA polymerase II promoter. 

9 32. The lentiviral vector of claim 23, further comprising a second heterologous 

1 0 promoter or promoter-enhancer. 

11 33. The lentiviral vector of claim 1 , further comprising a heterologous nucleic acid 

12 encoding a selectable marker operably linked to a promoter. 

13 34. The lentiviral vector of claim 1, further comprising a heterologous nucleic acid 

14 encoding a reporter molecule operably linked to a promoter. 

15 35. The lentiviral vector of claim 34, wherein the reporter molecule is selected 

16 from the group consisting of: GFP, EGFP, dsRed, dsRed2, cyan fluorescent 

17 protein, yellow fluorescent protein, blue fluorescent protein, dsRed, dsRed2 5 

1 8 luciferase, and aequorin. 

19 36. The lentiviral vector of claim 34, further comprising an RNA polymerase 

20 promoter. 

2 1 37. The lentiviral vector of claim 36, wherein the RNA polymerase promoter is an 

22 RNA polymerase III promoter. 

23 38. The lentiviral vector of claim 1 , wherein the lentiviral vector is a transfer 

24 plasmid, further comprising a genetic element sufficient for stable 

25 maintenance of the transfer plasmid as an episome within mammalian cells. 

26 39. A lentiviral vector comprising an RNA polymerase III promoter. 
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1 40. The lenti viral vector of claim 39, wherein the RNA polymerase III promoter is 

2 a U6 promoter. 

3 41 . The lentiviral vector of claim 39, wherein the RNA polymerase III promoter is 

4 an HI promoter. 

5 42. The lentiviral vector of claim 39, further comprising a heterologous nucleic 

6 acid encoding a reporter molecule. 

7 43. A lentiviral vector having a sequence as set forth in SEQ ID NO: 2, SEQ ID 

8 NO: 3, SEQ ID NO: 4, SEQ ED NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ 

9 ID NO: 8, or SEQ ID NO: 9. 

10 44. A collection of at least two of the lentiviral vectors of claim 43 . 

11 45. The collection of claim 44, wherein the collection includes a vector 

12 comprising a first heterologous promoter element and a vector comprising a 

13 second heterologous promoter element different from the first promoter 

14 element. 

15 46. The collection of claim 44, wherein the collection includes a vector 

16 comprising a first heterologous reporter gene and a vector comprising a 

17 second reporter gene different from the first reporter gene. 



18 47. A lentiviral vector having a sequence that differs by not more than 1 00 

19 nucleotides from the sequence set forth in SEQ ID NO: 2, SEQ ID NO: 3, 

20 SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 

21 8, or SEQ ID NO: 9. 

22 48. A collection of at least two of the lentiviral vectors of claim 47. 

23 49. The collection of claim 47, wherein the collection includes a vector 

24 comprising a first heterologous promoter element and a vector comprising a 

25 second heterologous promoter element different from the first promoter 

26 element. 
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1 50. The collection of claim 47, wherein the collection includes a vector 

2 comprising a first heterologous reporter gene and a vector comprising a 

3 second reporter gene different from the first reporter gene. 

4 51. A lentiviral vector having a sequence that differs by not more than X 

5 nucleotides from the sequence set forth in SEQ ID NO: 2, SEQ ID NO: 3, 

6 SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 

7 8, or SEQ ID NO: 9, where X represents any number between 1 and 99, 

8 inclusive. 

9 52. A collection of at least two of the lentiviral vectors of claim 5 1 . 

10 53 . A collection of at least two of the lentiviral vectors of claim 5 1 . 

11 54. The collection of claim 53, wherein the collection includes a vector 

12 comprising a first heterologous promoter element and a vector comprising a 

13 second heterologous promoter element different from the first promoter 

14 element. 

15 55 . The collection of claim 53, wherein the collection includes a vector 

16 comprising a first heterologous reporter gene and a vector comprising a 

17 second reporter gene different from the first reporter gene. 

18 56. A three-plasmid lentiviral expression system comprising: 

19 (a) a first plasmid whose sequence comprises a nucleic acid sequence 

20 of at least part of a lentiviral genome, wherein the plasmid (i) contains at least 

21 one defect in at least one gene encoding a lentiviral structural protein, and (ii) 

22 lacks a functional packaging signal; 

23 (b) a second plasmid whose sequence comprises a nucleic acid 

24 sequence of a virus, wherein the plasmid (i) expresses a viral envelope protein, 

25 and (ii) lacks a functional packaging signal; and 

26 (c) a third plasmid whose nucleic acid sequence includes (i) a 

27 functional packaging signal; (ii) a multiple cloning site (MCS); and (iii) at 

28 least one additional element selected from the group consisting of: a second 

29 MCS, a second MCS into which a heterologous nucleic acid is inserted; an 
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1 HIV FLAP element, an expression-enhancing posttranscriptional regulatory 

2 element, a target site for a site-specific recombinase, and a self-inactivating 

3 (SIN) LTR. 

4 57. A four plasmid lentiviral expression system comprising the three plasmid 

5 lentiviral expression system of claim 56, further comprising a fourth plasmid 

6 comprising a nucleic acid segment that encodes Rev, operably linked to a 

7 promoter. 

8 58. A cell comprising the lentiviral vector of claim 1 . 

9 59. The cell of claim 58, wherein the cell comprises a nucleic acid or nucleic acids 

10 having sequences encoding Gag, Pol, and Env proteins. 

11 60. A cell comprising a pro virus derived from the lentiviral vector of claim 1 . 

12 61 . A transgenic animal, at least some of whose cells contain the lentiviral vector 

13 of claim 1. 

14 62. A transgenic animal, at least some of whose cells contain a provirus derived 

15 from the lentiviral vector of claim 1 . 

16 63. A method of creating a producer cell line comprising introducing the lentiviral 

17 vector of claim 1 into a host cell, wherein the lentiviral vector is a transfer 

18 plasmid; and introducing a packaging plasmid and an envelope plasmid into 

19 the host cell. 

20 64. A method of producing lentiviral particles comprising 

21 (i) introducing the lentiviral vector of claim 1 into a helper cell, 

22 wherein the lentiviral vector is a transfer plasmid comprising a genetic 

23 element sufficient for stable maintenance of the plasmid as an episome in 

24 mammalian cells, into a helper cell that expresses proteins required for 

25 production of infectious lentiviral particles; and 

26 (ii) culturing the cell for a period sufficient to allow production of 

27 lentiviral particles. 
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1 65 . A method of producing lentiviral particles comprising 

2 (i) introducing the lentiviral vector of claim 1, which lentiviral vector 

3 is a lentiviral transfer plasmid comprising a genetic element sufficient for 

4 stable maintenance of the transfer plasmid as an episome in mammalian cells, 

5 into a helper cell that expresses a protein required for production of lentiviral 

6 particles, wherein expression of the protein is under control of an inducible 

7 promoter; 

8 (ii) inducing expression of the protein required for production of 

9 lentiviral particles; and 

10 (iii) culturing the cell for a period sufficient to allow production of 

1 1 lentiviral particles. 

12 66. A method of expressing a heterologous nucleic acid in a target cell comprising 

13 introducing a lentiviral vector of claim 1 into the target cell, wherein 

14 the lentiviral vector comprises a heterologous nucleic acid operably linked to a 

15 promoter; and 

1 6 expressing the heterologous nucleic acid therein. 

17 67. A method for achieving controlled expression of a heterologous nucleic acid 

18 in a cell comprising steps of: 

19 (i) providing a modified lentiviral vector comprising a heterologous 

20 nucleic acid inserted between sites for a recombinase; 

21 (ii) introducing the modified lentiviral vector or a portion thereof 

22 including at least the sites for the recombinase and the region between the sites 

23 into the cell and; 

24 (iii) subsequently inducing expression of the recombinase within the 

25 cell, thereby preventing expression of the heterologous nucleic acid within the 

26 cell. 

27 68. The method of claim 67, wherein the providing step comprises inserting the 

28 heterologous nucleic acid into a lentiviral vector between sites for a 

29 recombinase, thereby producing a modified lentiviral vector. 

30 69. The method of claim 67, wherein the cell is a mammalian cell. 
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1 70. A method for expressing a transcript in a mammal in a cell type or tissue- 

2 specific manner comprising: 

3 (i) delivering a lentiviral vector to cells of the mammal, wherein the 

4 lentiviral vector comprises a heterologous nucleic acid operably linked to a 

5 promoter so that transcription from the promoter results in synthesis of the 

6 transcript, and wherein the heterologous nucleic acid is located between sites 

7 for a site-specific recombinase; and 

8 (ii) inducing expression of the site-specific recombinase in a subset of 

9 the cells of the mammal, thereby preventing synthesis of the transcript within 

10 those cells. 

11 71. The method of claim 67 or claim 70, wherein the step of inducing the site- 

12 specific recombinase comprises introducing a vector encoding the site-specific 

13 recombinase into the cell. 

14 72. The method of claim 67 or claim 70, wherein expression of the site-specific 

15 recombinase is under control of a cell type specific or tissue specific promoter. 

16 73. The method of claim 67 or claim 70, wherein the sites are loxP sites and the 

17 site-specific recombinase is loxP. 

18 74. A lentiviral vector whose presence within a cell results in transcription of one 

19 or more ribonucleic acids (RNAs) that self-hybridize or hybridize to each 

20 other to form a short hairpin RNA or short interfering RNA that inhibits 

21 expression of at least one target transcript in the cell. 

22 75. The lentiviral vector of claim 74, wherein the vector provides a template for 

23 synthesis of an RNA that self-hybridizes to form an shRNA that is targeted to 

24 the transcript. 

25 76. The lentiviral vector of claim 75, wherein the shRNA comprises a loop having 

26 a sequence set forth in SEQ ID NO: 10. 
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1 77. The lenti viral vector of claim 74, wherein the vector provides a template for 

2 synthesis of complementary RNAs that hybridize with each other to form an 

3 siRNA that is targeted to the transcript. 

4 78. The lentiviral vector of claim 74, wherein the vector comprises a nucleic acid 

5 segment operably linked to a promoter, so that transcription from the promoter 

6 results in synthesis of one or more RNAs that self-hybridize or hybridize with 

7 each other to form an shRNA or siRNA targeted to the transcript. 

8 79. The composition of claim 74, wherein the lentiviral vector is a lentiviral 

9 transfer plasmid. 

10 80. The composition of claim 74, wherein the lentiviral vector is an infectious 

1 1 lentiviral particle. 

12 81. The lentiviral vector of claim 74, wherein: 

13 the shRNA or siRNA comprises a base-paired region approximately 1 9 

14 nucleotides long. 

1 5 82. The lentiviral vector of claim 74, wherein: 

16 the shRNA or siRNA comprises a base-paired region and at least one 

1 7 single-stranded overhang. 

18 83 . The lentiviral vector of claim 74 wherein: 

19 the siRNA or shRNA comprises a 3' overhang consisting of at least 

20 two pyrimidines. 

21 84. The lentiviral vector of claim 83, wherein the 3' overhang is UU. 

22 85 . The lentiviral vector of claim 74, wherein: 

23 the shRNA or siRNA comprises a region that is precisely 

24 complementary with a region of the target transcript. 

25 86. The lentiviral vector of claim 74, wherein the siRNA or shRNA is present at a 

26 level sufficient to reduce the level of the target transcript or its encoded 

27 protein by at least about 2 fold. 
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1 87. The lentiviral vector of claim 74, wherein the siRNA or shRNA is present at a 

2 level sufficient to reduce the level of the target transcript or its encoded 

3 protein by at least about 5 fold. 

4 88. The lentiviral vector of claim 74, wherein the siRNA or shRNA is present at a 

5 level sufficient to reduce the level of the target transcript or its encoded 

6 protein by at least about 10 fold. 

7 89. The lentiviral vector of claim 74, wherein the siRNA or shRNA is present at a 

8 level sufficient to reduce the level of the target transcript or its encoded 

9 protein by at least about 25 fold. 

10 90. The lentiviral vector of claim 74, wherein the lentiviral vector comprises: 

11 (i) a functional packaging signal; 

12 (ii) a multiple cloning site (MCS); and 

13 (iii) at least one additional element selected from the group consisting 

14 of: a second MCS, a second MCS into which a heterologous promoter or 

15 promoter-enhancer is inserted, an HIV FLAP element, an expression- 

16 enhancing posttranscriptional regulatory element, a target site for a site- 

17 specific recombinase, and a self-inactivating (SIN) LTR. 

18 91. A composition comprising: 

19 the lentiviral vector of claim 74; and 

20 a delivery agent that enhances delivery of the vector to cells. 

21 92. A pharmaceutical composition comprising: 

22 the lentiviral vector of claim 74; and 

23 a pharmaceutically acceptable carrier. 

24 93. A three plasmid lentiviral expression system comprising (i) a lentiviral transfer 

25 plasmid comprising a heterologous nucleic acid operably linked to a promoter, 

26 so that transcription of the heterologous nucleic acid produces one or more 

27 RNAs that self-hybridize or hybridize with each other to form an shRNA or 

28 siRNA targeted to a target transcript; (ii) a packaging plasmid; and (iii) an 

29 Env-coding plasmid. 
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1 94. A four plasmid lentiviral expression system comprising the three plasmid 

2 lentiviral expression system of claim 93, further comprising a fourth plasmid 

3 comprising a nucleic acid segment that encodes Rev, operably linked to a 

4 promoter. 

5 95. A method of inhibiting or reducing the expression of a target transcript in a 

6 cell comprising delivering the lentiviral vector of claim 74 to the cell. 

7 96. The method of claim 95, wherein the lentiviral vector comprises: 

8 (i) a functional packaging signal; 

9 (ii) a multiple cloning site (MCS); and 

10 (iii) at least one additional element selected from the group consisting 

11 of: a second MCS, a second MCS into which a heterologous promoter or 

12 promoter-enhancer is inserted, an HIV FLAP element, an expression- 

13 enhancing posttranscriptional regulatory element, a target site for a site- 

14 specific recombinase, and a self-inactivating (SIN) LTR . 

15 97. The method of claim 95, wherein the cell is a mammalian cell. 

16 98. The method of claim 95, wherein the cell is a primary cell. 

17 99. The method of claim 95, wherein the primary cell is a T cell. 

18 1 00. The method of claim 95, wherein the cell is a non-dividing cell. 

19 101 . The method of claim 95, wherein the cell is an embryonic stem cell. 

20 1 02. The method of claim 95, wherein the cell is a single-cell embryo. 

21 103. The method of claim 95, wherein the lentiviral vector is a lentiviral transfer 

22 plasmid. 

23 104. The method of claim 95, wherein the lentiviral vector is an infectious lentiviral 

24 particle. 
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1 105. The method of claim 95, wherein the ribonucleic acid comprises 

2 complementary regions that self-hybridize to form a short hairpin RNA 

3 targeted to the transcript. 

4 1 06. A method of reversibly inhibiting or reducing expression of a target transcript 

5 in a cell comprising steps of: 

6 (i) delivering a lentiviral vector to the cell, wherein presence of the 

7 lentiviral vector within the cell results in synthesis of one or more RNAs that 

8 self-hybridize or hybridize with each other to form an shRNA or siRNA that 

9 inhibits expression of the target transcript, wherein the lentiviral vector 

10 comprises a nucleic acid segment located between sites for a site-specific 

1 1 recombinase, which nucleic acid segment provides a template for transcription 

12 of the one or more RNAs; and (ii) inducing expression of the site-specific 

13 recombinase within the cell, thereby preventing synthesis of at least one of the 

14 RNAs. 

15 107. The method of claim 105, wherein the cell is a mammalian cell. 

16 108. The method of claim 105, wherein the recombinase is Cre and the sites are 

17 loxP sites. 

18 109. The method of claim 105, wherein the lentiviral vector is a lentiviral transfer 

19 plasmid. 

20 110. The method of claim 105, wherein the lentiviral vector is a lentiviral particle. 

21 111. The method of claim 105, wherein the lentiviral vector provides a template for 

22 synthesis of an RNA comprising complementary portions that hybridize to 

23 form an shRNA. 

24 1 12. A method for reversibly inhibiting or reducing expression of a transcript in a 

25 mammal in a cell type or tissue-specific manner comprising: 

26 (i) delivering to the mammal a lentiviral vector whose presence within 

27 a cell results in synthesis of one or more RNAs that self-hybridize or hybridize 

28 with each other to form an shRNA or siRNA that inhibits expression of the 
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1 target transcript, wherein the lentiviral vector comprises a nucleic acid 

2 segment located between sites for a site-specific recombinase, which nucleic 

3 acid segment provides a template for transcription of the RNA; and 

4 (ii) inducing expression of the site-specific recombinase in a subset of 

5 the cells of the mammal, thereby preventing synthesis of at least one of the 

6 RNAs within the subset of cells. 

7 113. The method of claim 112, wherein the recombinase is Cre and the sites are 

8 loxP sites. 

9 114. The method of claim 112, wherein the lentiviral vector is a lentiviral transfer 

10 plasmid. 

11 115. The method of claim 1 12, wherein the lentiviral vector is a lentiviral particle. 

12 116. The method of claim 112, wherein the lentiviral vector provides a template for 

1 3 synthesis of an RNA comprising complementary portions that hybridize to 

14 form an shRNA. 

15 117. A method of treating or preventing infection by an infectious agent, the 

16 method comprising steps of: 

17 administering to a subject prior to, simultaneously with, or after 

18 exposure of the subject to the infectious agent, a composition comprising an 

19 effective amount of a lentiviral vector, wherein presence of the lentiviral 

20 vector in a cell results in synthesis of one or more RNAs that self-hybridize or 

21 hybridize with each other to form an shRNA or siRNA that is targeted to a 

22 transcript produced during infection by the infectious agent, which transcript 

23 is characterized in that reduction in levels of the transcript delays, prevents, or 

24 inhibits one or more aspects of infection by or replication of the infectious 

25 agent. 

26 118. The method of claim 117, wherein the lentiviral vector provides a template for 

27 synthesis of an RNA that comprises complementary portions that hybridize to 

28 form an shRNA. 
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1 1 19. A method of treating or preventing a disease or clinical condition, the method 

2 comprising: 

3 removing a population of cells from a subject at risk of or suffering 

4 from disease or clinical condition; 

5 engineering or manipulating the cells to contain an effective amount of 

6 an siRNA or shRNA targeted to a transcript, which transcript is characterized 

7 in that its degradation delays, prevents, or inhibits one or more aspects of the 

8 disease or clinical condition; and 

9 returning at least a portion of the cells to the subject. 

10 120. The method of claim 119 wherein: 

1 1 the engineering or manipulating step comprises introducing a lentiviral 

12 vector into the cells, wherein presence of the lentiviral vector in a cell results 

13 in synthesis of one or more RNAs that self-hybridize or hybridize with each 

14 other to fomi an shRNA or siRNA targeted to the transcript. 

15 121. The method of claim 119, wherein: 

16 the cells comprise stem cells. 

17 1 22 . The method of claim 121, wherein: 

18 the stem cells are peripheral blood stem cells. 

19 123. The method of claim 119, further comprising: 

20 expanding at least a portion of the cells in culture. 

21 124. A kit comprising (a) a lentiviral transfer plasmid comprising a nucleic acid 

22 sequence including (i) a functional packaging signal; (ii) a multiple cloning 

23 site (MCS) into which a heterologous gene may be inserted; and (iii) at least 

24 one additional element selected from the group consisting of: a second MCS, 

25 an HIV FLAP element, a heterologous promoter, a heterologous enhancer, an 

26 expression-enhancing posttranscriptional regulatory element, a target site for a 

27 site-specific recombinase, and a self-inactivating (SIN) LTR; and one or more 

28 of the following items: (b) a packaging mix comprising one or more plasmids 

29 that collectively provide nucleic acid sequences coding for retroviral or 
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1 lentiviral Gag and Pol proteins and an envelope protein; (c) cells (e.g., a cell 

2 line) that are permissive for production of lentiviral particles such as 293T 

3 cells; (d) packaging cells, e.g., a cell line that is permissive for production of 

4 lentiviral particles and provides the proteins Gag, Pol, Env, and, optionally, 

5 Rev; (e) cells suitable for use in titering lentiviral particles; a transfection- 

6 enhancing agent such as Lipofectamine; (f) a selection agent such as an 

7 antibiotic, preferably corresponding to an antibiotic resistanc gene in the 

8 lentiviral transfer plasmid; (g) instructions for use; (h) a positive control 

9 plasmid. 
10 
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Figure 18 
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Figure 19 
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GFP 



A 50% confluent 1 0cm plate of D7 cells (Bear et ai. 2000), was infected with 1 0Oul 
of concentrated pLL3.7 B-catenin lentivirus, which expressed GFP as a transgene 
between two LoxP sites. Infected cells were sorted based upon expression of 
EGFP (A, green line). A 50% confluent 6cm-plate of sorted D7 pLL3.7 b-catenin 
cells was infected with adenovirus expressing the Cre recombinase. 1x1 0 A 5 
infectious units were used in the infection. Cells were expanded for 10 days to 
allow for expression of Cre protein, deletion of lox-CMVgfp-lox, and depletion of 
EGFP protein pools. Cells were then analyzed by flow cytometry for expression of 
EGFP (B, pink line). Cells were also sorted based upon loss of EGFP expression 
and expanded. Purple solid peak in A and B represent uninfected control. 
Percentage GFP + cells are shown on each plot. A direct comparison between 
pLL3.7 infected D7 cells before (green line) and after (pink line) Cre delivery is 
seen in C. 
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Figure 23 
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Figure 25 
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